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Description 

( The present invention relates to fusions of AIDS-related polypeptides. It may provide immunological products 
derived from molecular cloning. More particularly, embodiments of this invention relate to immunological polypeptides 
s useful as diagnostic products and vaccines in the detection of and vaccination against viral etiological agents of acquired 
immunodeficiency syndrome. 

Analysis of the immune response to a variety of viral infectious agents has been limited by the fact that it has often 
proved difficult to culture such pathogens in quantities sufficient to permit the isolation of viral polypeptides which may 
be used (a) as a diagnostic product to detect an immune response to such pathogens or (b) as a vaccine to confer 
70 resistance to infection by such pathogens. 

The advent of molecular cloning has overcome some of these limitations by providing a means whereby gene 
products from pathogenic agents can be expressed in virtually unlimited quantities in a non -pathogenic form. 

Although antigens from the viruses for influenza (1 ), foot and mouth disease (2) , hepatitis (3) , vesicular stomatitis 
virus and (4), rabies (5), have been reported to be expressed in E. coli, several laboratories have reported that the 
15 surface antigen for hepatitis B expressed in prokaryotes is not immunologically reactive with antisera to the naturally 
occurring antigen (6). 

Acquired immunodeficiency syndrome (hereinafter referred to as AIDS) is a devastating disease of the adult im- 
mune system which significantly affects cell-mediated immunity. The disease is manifested by a profound lymphopenia 
which appears to be the result of a loss of T-lymphocytes that have the helper/inducer phenotype T4 as defined by the 
20 monoclonal antibody OKT4 (7). Other clinical manifestations include opportunistic infections, predominantly Pneumo- 
cystis carinii pneumonia, and Karposi's sarcoma (7). 

* Pre-AIDS, a syndrome that often precedes the onset of AIDS, is characterized by chronic generalized lymphade- 
nopathy. It has been reported that about 1 0% of patients with pre-AIDS develop AIDS (8). 

The predominant risk group for AIDS includes homosexual and bisexual males, intraveneous drug abusers, re- 
25 cipients of blood components (primarily hemophiliacs receiving treatment with Factcr VIII) and recipients of blood trans- 
fusions (7). The epidemiology of this syndrome indicates that Al DS is caused by an infectious agent which is transmitted 
by intimate contact or contact with blood or blood products (7). 

Recent studies have suggested that a human retrovirus related to the previously described human T-lymphotropic 
viruses HTLV-I and HTLV-ll, may be the causative agent for AIDS (8, 9, 10). Several laboratories have recently isolated 
30 and propagated retrovirus associated with AIDS infection. Gallo has identified a virus designated HTLV-III which has 
been isolated from AIDS patients (11) and propagated in a permissive T-cell line (12). Similarly, Luc Montagnier of the 
Pasteur Institute in Paris has found a virus in AIDS patients designated LAV (lympadenopathy associated virus). Some 
authors have reported that LAV may be similar or identical to HTLV-III (1 3, 14). More recently, a virus isolated from an 
AIDS patient, designated ARV (Aids Related Virus), has been identified and cloned (15). These and other AIDS retro- 
35 viruses are hereinafter referred to as AIDS associated retrovirus. 

HTLV-III is believed to be a retrovirus with a genome comprising approximately 10kb of RNA. The virus particle 
contains a capsid consisting of a number of proteins. The primary core protein is designated p-24 and has a corre- 
sponding molecular weight of about 24,000 daltons. The p-24 core protein is synthesized in vivo as part of a precursor 
polypeptide encoded by the gag region of the HTLV-III genome. This precursor polypeptide is processed in the infected 
40 cell to form p-24 and other viral proteins. In addition, an envelope protein designated gp-41 (MW 41,000 daltons), is 
also a constituent of the virus particle and is encoded in the env region of the HTLV-III genome. 

Antibodies to the envelope protein pp-41 of HTLV-I 1 1 have been detected in serum from Al DS and pre-AI DS patients. 
Approximately 88% of AIDS and 79% pre-AIDS patients have detectable antibodies to HTLV-III envelope protein (16). 
One author has reported that there is a 90-100% sero-conversion to such antibodies (8). In addition to antibodies to 
45 the gp-41 envelope protein, antibodies to the HTLV-III core protein p-24, and the HTLV-III proteins designated p-55, 
p-60 as well as the glycoproteins gp-65, gp-120 and gp-160 have been detected in patients with AIDS (16, 17). The 
detection of antibodies to these and other as yet unknown antigens is, therefore, a significant indication of exposure 
to or productive infection. by the agent responsible for AIDS. 

In addition, certain of these viral polypeptide sequences may be used as a vaccine to induce the production of 
50 neutralizing antibodies conferring resistance to AIDS infection. A need exists for variant sequences such as fusions of 
the viral polypeptide with highly immunogenic polypeptides, in order to facilitate induction of a high titer immune re- 
sponse as well as deletions of undesired viral sequences. 

Accordingly, it is an object herein to express, in a prokaryotic host, viral antigens of an AIDS associated retrovirus 
which are non-pathogenic and which may be used as diagnostic product to detect AIDS. 
55 it is a further object of the present invention to prokaryotically express a diagnostic product to detect Al DS consisting 

of variant composite polypeptides of naturally occurring AIDS related polypeptide sequences. 

Further, an object of the present invention is to express AIDS associated polypeptides or variants, including fusion 
polypeptides of an AIDS associated polypeptide which may be used as a vaccine against AIDS. 
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According to the present invention there is provided a process which comprises the preparation of a fusion polypep- 
tide comprising (a) a first polypeptide sequence of an AIDS associated retrovirus E\ env or gag polypeptide having at 
least one antigenic determinant capable of specifically binding complementary antibody and (b) a second polypeptide 
sequence which is not an AIDS associated virus polypeptide; wherein said E' polypeptide of AIDS-associated retrovirus 

5 is defined as the 206 residue polypeptide designated "E ,n in Fig. 2, its naturally-occurring alleles, or its amino acid 
sequence variants which are immunologically cross-reactive with antisera capable of binding the E' polypeptide pro- 
duced in cells infected with AIDS-associated retrovirus. 

The entire genome of an AIDS-associated retrovirus has been cloned and sequenced. We have identified DNA 
encoding AIDS-associated polypeptides and employed it in recombinant prokaryotic or mammalian cell culture systems 

10 to synthesize predetermined AIDS-associated polypeptides and derivatives and amino amino acid sequence variants 
thereof. Derivatives of AIDS-associated polypeptides include unglycosylated or variantly glycosylated polypeptides, 
as well as formylmethionyl N-terminal species. 

Variants of normal AIDS-associated virus polypeptides are provided wherein one or more amino acid residues of 
the normal polypeptides have been deleted or substituted, by other residues, or one or more amino acid residues 

15 . inserted. These variants include predetermined fragments of normal polypeptides (deletion variants), fusion polypep- 
tides containing an AIDS-associated virus polypeptide or fragment thereof with a second polypeptide which is not AIDS- 
associated virus polypeptide or fragment thereof (fusion or insertional variants), and all of the above in which an amino 
residue has been substituted. 

DNA encoding the polypeptides of this invention are provided, as well as vectors operably incorporating such DNA 

20 and mammalian or prokaryotic cell cultures transformed therewith. 

The predetermined polypeptide sequences of an AIDS-associated retrovirus produced herein are essentially free 
of other naturally occurring AIDS related polypeptides. These polypeptides contain at least one antigenic determinant 
which is capable of specifically binding complementary antibody. Specific polypeptide sequences described herein are 
designated p-24, p-1 5, E\ reverse transcriptase and envelope (env) polypeptides. In preferred embodiments, fragments 

25 and/or fusions of predetermined polypeptide sequences of an AIDS associated retrovirus, containing at least one an- 
tigenic determinant capable of specifically binding complementary antibody, are provided. In one species of this em- 
bodiment, a C-terminal portion of the p-24 and the entire p-1 5 sequence is removed. This truncated p-24, expressed 
in E. cpjj, demonstrates an unexpected reactivity with antibody to an AIDS associated retrovirus. Particularly preferred 
fragments are those of the env protein gp120 which are able to bind normal host cell receptors in competition with the 

30 jntact virus, as well as E' polypeptide fragments. 

In one type of embodiment, the fusion polypeptides comprise (a) a predetermined polypeptide sequence of an 
AIDS associated retrovirus or fragment thereof having at least one antigenic determinant capable of specifically binding 
complementary antibody and (b) a second polypeptide sequence which is not immunologically reactive with antibodies 
normally present in a biologically derived sample which is to be assayed for the presence of antibodies to AIDS asso- 

35 ciated retrovirus. If the fusion polypeptide is to be used as a component of a vaccine the second polypeptide sequence 
also should be incapable of inducing antibodies which are cross-reactive with polypeptides which are naturally occurring_ 
in the subject such vaccine is directed to. In this case, polypeptides from lower eukaryotes other than from yeast are 
useful. 

In one species of a fusion polypeptide, DNA sequences encoding a polypeptide sequence of human growth hor- 
40 mone (HGH) are positioned five prime to a DNA sequence encoding the p-24 and p-1 5 polypeptide sequence of HTLV- 
III. This is expressed and processed in E. coli as an HGH-p24 fusion polypeptide which specifically binds complemen- 
tary antibody. Similarly, fusions of E' and env polypeptides such as gp41 and gp1 20 with viral, synthetic or prokaryotic 
polypeptides are provided. These results demonstrate that viral antigens associated with AIDS may be expressed as 
a fusion polypeptide to detect naturally occurring antibodies produced by AIDS infected individuals. 
45 The polypeptide sequences of this invention are formulated into therapeutically effective dosages with a pharma- 

ceutically acceptable vehicle and administered to AIDS-susceptible animals in order to induce the production of anti- 
bodies to such polypeptide sequences. 

In a contemplated embodiment, a polypeptide comprising or consisting of a predetermined polypeptide sequence 
of RNA dependent DNA polymerase (reverse transcriptase) from an AIDS associated retrovirus is cloned and ex- 
50 pressed for use in an assay system to identify compounds which inhibit such AIDS associated reverse transcriptase. 
Such compounds may be administered as a pharmaceutical agent to inhibit infection by AIDS associated retrovirus or 
dissemination of such retrovirus in infected individuals. 

Brief Description of the Drawings 

55 ~ " "~ " ~ " *~ 

Figure 1 shows the amino terminal polypeptide sequence of p-24 core protein obtained from live HTLV-III retrovirus 
and the corresponding DNA sequence deduced therefrom. 

Figures 2a-2d show the DNA sequence, the putative polypeptide sequence and a partial restriction map of the 
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HTLV-III genome. The terminal repeat ( n R M ) and unique regions (U5 for 5' and U3 for 3') are shown. The putative 
positions assigned to the gag region, reverse transcriptase ("pol"), envelope ("env") and E' and P proteins are shown. 
The designations sd and sa are splice donor and splice acceptor sites, respectively. Overlapping amino acid sequences 
represent putative mRNA splicing phenomena. Alternative bases indicate differences found among clones, indicative 
s of variation found among strains in the viral population. 

Figure 2e depicts a restriction enzyme map of the provirus DNA and overlapping cDNA clones. 

Figure 2f shows the putative mRNA splicing activity of the virus, the resulting messengers and the proteins thereby 
encoded. 

Figure 3 depicts the strategy employed to subclone p7.11 . 
10 Figure 4 depicts the construction of p-24 and truncated p-24 expression vectors. 

Figure 5 depicts the construction of expression vectors for a p-24 composite polypeptide and a truncated p-24 
composite polypeptide. 

Figure 6 is a Western Blot of the polypeptide expressed by pDE-24, pDE24AHD3, pHGHp-24AB and pHGHp- 
24AHD3. The blot was treated with rabbit anti-HTLV-lll antibody. 
'5 Figure 7 is a Western blot of the polypeptide expressed by pDE-24. The blot was treated with human serum derived 

from an individual with AIDS. 

Figures 8a, 9a and 10a depict the structure of the plasmids ptrpLE.gp41, ptrpLE.E' and pSVE.E'DHFR. These 
plasmids are used for the expression, respectively, of fusions of the env and the E' protein of HTLV-III in bacteria and 
of the unfused E' protein in CHO cell culture. 
20 Figures 8b, 9b and 10b show polyacrylamide gel electrophoresis patterns demonstrating the synthesis, respec- 

tively, of the protein fusions encoded by ptrpLE.gp41, ptrpLE.E' and unfused E 1 protein encoded by pSVE.E'DHFR. 

Figure 11a shows the region of the HTLV-III genome used for env protein preparation in recombinant higher eu- 
karyotic cell culture. 

Figure 11b depicts an expression plasmid for secreting an HTLV-III envelope protein from cultured mammalian 
25 cell culture transformants. 



Detailed Description 

Applicants have demonstrated that viral protein from an AIDS associated retrovirus can be expressed directly or 

30 as a variant polypeptide in host cells and that such recombinant polypeptides are capable of specifically binding anti- 
body to an AIDS associated retrovirus. Such variant polypeptides include viral polypeptides fused with one or more 
second polypeptide sequences as well as deletions, insertions, substitutions and derivatives of the viral polypeptide. 
In addition, directly expressed and certain variant polypeptides, each of which contain fragments of a predetermined 
polypeptide sequence of an AIDS associated retrovirus, also react with antibody to AIDS associated retrovirus. These 

35 results indicate that such recombinant polypeptides may be used as diagnostic agents to detect AIDS in individuals, 
donated blood and blood products. 

Further, such polypeptides may be used as immunogens to induce the production of neutralizing antibodies which 
confer resistance to infection by an AIDS associated retrovirus. 

Still further, the reverse transcriptase of an AIDS associated retrovirus may be used to identify compounds which 

40 may inhibit infection by AIDS associated retrovirus or the dissemination of such retrovirus in infected individuals. 

The fusion polypeptides of the present invention comprise an Al DS associated polypeptide sequence and a second 
polypeptide sequence. These second polypeptide sequences may be used to: 1 ) promote secretion of the fusion 
polypeptide from a bacterial host into the extra-cellular environment or the periplasm of gram negative bacteria, 2) 
facilitate the functional association of the fusion polypeptide with the surface membrane of recombinant host cells or 

45 3) provide a polypeptide sequence which may be used to purify the fusion polypeptide (e.g. purification of an HGH- 
AIDS fusion polypeptide by fractionation on an immunoadsorbent specific for HGH). Depending upon the particular 
applications, second polypeptide sequences used to form a fusion polypeptide with an AIDS associated retrovirus may 
be of prokaryotic or eukaryotic origin and may be positioned at the amino terminus, carboxy terminus, at both ends of 
the AIDS associated polypeptide sequence, or inserted within the AIDS associated polypeptide sequence. 

50 Examples of second polypeptide sequences which may be used to promote secretion of the fusion polypeptide 

include (1) the signal sequence of Herpes Simplex Virus gD protein disclosed in copending U.S. application 527,917 
of 29 August 1983 (EP 139417); (2) the signal sequence of E. coli alkaline phosphatase or E. coli enterotoxin STII 
disclosed in copending U.S. application 658,342 of 5 October 1 984 (EPA 85307044.9), and references disclosed there- 
in, and (3) pre-HGH disclosed in copending U.S. application 488,232 of 25 April 1983 (EP 127305) or other higher 

55 eukaryotic signal sequences such as that of gamma interferon. 

An example of a second polypeptide sequence which facilitates functional membrane association is the transmem- 
brane sequence of Herpes Simplex virus disclosed in U.S. application 527,917 filed August 29, 1983 (EP 139417). 
Since many individuals at risk for AIDS also have antibodies to E. coli and other enterobacteria, the second polypep- 
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tide must be chosen to avoid false positive immunological reactivity with these antibodies. Polypeptide sequences from 
enterobacteria should therefore be used as a second polypeptide only if such sequences are removed during process- 
ing or otherwise prevented from reacting with the biologically derived samples to be assayed for the presence of an- 
tibody produced in response to infection by an Al DS associated virus, e.g. by recombinant expression such that the 
s bacterial protein epitopes are modified so as to no longer be cross-reactive with the native protein (see the LE fusions 
described below). 

When the fusion polypeptide of the present invention is used as a vaccine against AIDS infection, the second 
polypeptide sequence must be chosen to avoid the production of antibodies to polypeptides which are naturally occur- 
ring in the subject such vaccine is directed to. For example, in a vaccine for humans the second polypeptide sequence 

10 is preferably not HGH. Such vaccines, however, may contain prokaryotic polypeptide sequences or preferably eukary- 
otic polypeptide sequences other than those of yeast and primates. 

The present invention specifically discloses the cloning and expression of certain HTLV-lll-encoded polypeptides. 
However, the present invention also contemplates the cloning and expression of other HTLV-tll polypeptides. HTLV : 
III polypeptides which possess antigenic determinants to antibodies for AIDS and pre-AIDS patents include gp-160, 

15 gp-1 20, gp-65, gp-41 , p-60/p-55. The gp-1 20 polypeptide appears to be a precursor polypeptide for gp-65 and gp-41 . 
These particular HTLV-III polypeptides are illustrative and are not intended to limit the scope of the invention. 

In addition, the present invention contemplates the generation of a library of products, each containing different 
antigenic determinants that may be used to determine which antigenic determinants are best suited for detection of 
AIDS or pre-AIDS. Such a library, for example, may be used to determine which antigenic determinants are immuno- 

20 logically reactive to serum derived from healthy individuals who are serologically positive for AIDS. Those antigenic 
determinants which test positive to such serum but negative to serum from AIDS patients may be prime candidates 
for a vaccine to induce the production of neutralizing antibodies. Further, diagnostic products containing such antigenic 
determinants may be used to identify individuals with neutralizing antibodies who are unlikely to develop the severe 
clinical manifestations associated with AIDS. 

25 Although the present invention is based on studies of HTLV-III it is to be understood that HTLV-III may be similar 

or identical to LAV or ARV. As so related, polypeptide products derived from those retroviruses are within the scope 
of the present invention. Accordingly, the designation AIDS associated retrovirus refers to HTLV-III, LAV, ARV and/or 
other retrovirus that may cause AIDS or pre-AIDS. 

As used herein, a polypeptide sequence of an AICS associated retrovirus is the full length native polypeptide 

30 sequence or the predetermined sequence derived from genomic sequencing. 

A naturally occurring (native) polypeptide sequence is the polypeptide formed in virus infected cells or found in 
the culture fluid of such cells. 

Variant polypeptide sequences of an AIDS associated retrovirus include: (1) fusions of viral polypeptide or frag- 
ments thereof with second polypeptide sequences including N and C terminal fusions and insertions; (2) deletions of 

35 the N-terminal, C-terminal, or an internal region of the polypeptide sequence of viral polypeptide to produce a fragment 
of a polypeptide sequence of an AIDS associated retrovirus; (3) substitutions of one or more amino acids in a polypep- 
tide sequence of an AIDS associated retrovirus and (4) derivatives such as labelled or bound viral polypeptide se- 
quences which may be labelled by well known techniques or bound to a solid phase such as that disclosed in U.S. 
patent 3,720,760. 

40 "Second polypeptides" are sequences which are fused with a polypeptide sequence of an AIDS associated retro- 

virus or fragment thereof to form the fusion polypeptide sequences of the present invention. These second polypeptides 
may be full length or partial protein sequences of eukaryotic, non-AIDS viral or prokaryotic origin and may be used to 
promote secretion of the fusion polypeptide, facilitate association of the fusion polypeptide with the surface membrane 
of an expression host or aid in the purification of the fusion polypeptide. When used as a vaccine, the second polypeptide 

^5 of a fusion polypeptide is a sequence which is not normally capable of inducing antibodies which are cross-reactive 
with naturally occurring polypeptides, in the subject such vaccine is directed to. 

"Complementary antibody" refers to antibody raised against a corresponding naturally occurring viral epitope or 
epitope encoded by AIDS-associated retrovirus. 

A DNA sequence of an AIDS associated retrovirus encodes the polypeptide and variant polypeptide sequences 

50 of the present invention described above. 

"Biologically derived sample" includes any biological fluid or tissue sample taken from a human or animal subject 
which may be assayed to detect the presence of complementary antibody produced in response to exposure to or 
infection by an AIDS associated retrovirus. Such samples typically comprise blood, urine, semen, and saliva but may 
include any biological material in which such complementary antibody or AIDS associated retrovirus may be found. 

55 Prokaryotes are preferred for cloning and expressing DNA sequences to produce the diagnostic product and vac- 

cine of the present invention. For example, E. coli K12 strain 294 (ATCC No. 31446) is particularly useful. Other mi- 
crobial strains which may be used include E. coli strains such as_E. colj B, and E. coli X1 776 (ATCC No. 31537), and 
E coli c600 and c600hfl, E. coli W3110 (F~, X- prototrophic, ATTC No. 27325) , bacilli such as Bacillus subtilus , and 
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other enterobacteriaceae such as Salmonella typhimurium or Serratia marcesans , and various pseudomonas species. 
When expressed in prokaryotes the polypeptides of the present invention typically contain an N -terminal methionine 
or a tormyl methionine, and are not glcosylated. These examples are, of course, intended to be illustrative rather than 
limiting. 

s In general, plasmid vectors containing replication and control sequences which are derived from species compat- 

ible with the host cell are used in connection with these hosts. The vector ordinarily carries a replication site, as well 
as sequences which encode proteins that are capable of providing phenotypic selection in transformed cells. For ex- 
ample, JEcoli is typically transformed using pBR322, a plasmid derived from an E. coli species (18). Plasmid pBR322 
contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying and selecting 

10 transformed cells. The pBR222 plasmid, or microbial plasmid must also contain, or be modified to contain, promoters 
which can be used by the microbial organism for an expression of its own proteins. Those promoters most commonly 
used in recombinant DNA construction include (5-lactamase (penicillinase) and lactose promoter systems (19-21) and 
a tryptophan (trp) promoter system (22, 23). While these are the most commonly used, other microbial promoters have 
been discovered and utilized, and details concerning the their nucleotide sequences have been published, enabling a 

is skilled worker to ligate them functionally with plasmid vectors (24). In the specific embodiments disclosed, a trp promoter 
(22, 23) was used to express the diagnostic product and vaccine of the present invention. 

In addition to prokaryotes, eukaryotic cells may be used to express the AIDS associated virus polypeptides includ- 
ing particularly the reverse transciptase of an AIDS associated retrovirus. Saccharomyces cerevisiae , or common 
baker's yeast is the most commonly used among eukaryotic microorganisms, although a number of other strains are 

20 commonly available. For expression in Saccharomyces , the plasmid YRp7, for example, (25-27) is commonly used. 
The plasmid already contains the trgl gene which provides a selection marker for a mutant strain of yeast lacking the 
ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1 (28). The presence of the trpj lesion as a char- 
acteristic of the yeast host cell genome then provides an effective environment for detecting transformation by growth 
in the absence of tryptophan. 

25 Suitable promoting sequences in yeast vectors include the promoters for 3-phosphoglycerate kinase (29) or other 

glycolytic enzymes (30, 31 ), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate, 
decarboxylase, phosphof ructokinase, glucose-6-phosphate, isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. In constructing suitable expression plas- 
mids, the termination sequences associated with these genes are also ligated into the expression vector 3' of the 

30 sequence desired to be expressed to provide polyadenylation of the mRNA termination. Other promoters, which have 
the additional advantage of the transcription controlled by growth conditions are the promoter regions for alcohol de- 
hydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, and 
the aforementioned glyceraldehyde-3-phosphate dehydrogenase, and the enzymes responsible for maltose and ga- 
lactose utilization. Any plasmid vector containing yeast-compatible promoter, origin of replication and termination se- 

35 quences is suitable. 

Cultures of cells derived from multicellular organisms also are employed for expression of AIDS associated retro- 
virus proteins. Mammalian or vertebrate cells are of particular interest, such as VERO and HELA cells, Chinese Hamster 
ovary (CHO) cell lines, and WI38, BHK, COS-7 and MDCK cell lines. Expression vectors for such cells ordinarily include 
an origin of replication, a promoter for controlling expression of the DNA encoding the AIDS associated retroviral 

40 polypeptide, along with a mammalian selection marker, RNA splice site, polyadenylation site and transcriptional ter- 
minator sequences as required. 

Vectors capable of transforming mammalian host cells to expression of AIDS associated polypeptides are prefer- 
ably introduced into host cells with a selection marker, e.g. the gene encoding DHFR (dihydrofolate reductase) in known 
fashion and then amplified by exposing the transformants to increasing concentrations of selection agent, e.g. meth- 

45 otrexate. For example see U.S. patent 4,399,216. 

For use in mammalian cells, the transcriptional and translational control functions are conventionally obtained from 
viral sources. For example, commonly used promoters are derived from polyoma, Simian Virus 40 (SV40) and most 
particularly Adenovirus 2. The early and late promoters of SV40 virus are useful as is the major late promoter of ade- 
novirus as described above. Further, it is also possible, and often desirable, to utilize promoter or control sequences 

50 normally associated with the desired gene sequence, provided such control sequences are compatible with the host 
cell systems. 

An origin of replication may be provided either by construction of the vector to include an exogenous origin, such 
as may be derived from adenovirus or other viral (e.g. Polyoma, SV40, VSV, BPV, etc.) source, or may be provided by 
the host cell chromosomal replication mechanism, if the vector is integrated into the host cell chromosome. 
55 For vectors of the invention which comprise DNA sequences encoding both AIDS associated polypeptide and a 

cotransformation, selection and amplification gene such as the DHFR enzyme, it is appropriate to select the host 
according to the type of DHFR protein employed. If wild type DHFR protein is employed, it is preferable to select a 
host cell which is deficient in DHFR, thus permitting the use of the DHFR coding sequence as a marker for successful 
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transfection in selective medium which lacks hypoxanthine, glycine, and thymidine. 

On the other hand, if DHFR protein with low binding affinity for MTX is used as the controlling sequence, it is not 
necessary to use DHFR resistant cells. Because the mutant DHFR is resistant to methotrexate, MTX containing media 
can be used as means of selection provided that host cells are themselves methotrexate sensitive. 
^ s Alternatively, a wild type DHFR gene may be employed as an amplification marker in a host cell which is not 

deficient in DHFR provided that a second drug selectable marker is employed, such as neomycin resistance. 

An example, which is set forth hereinafter, contemplates the use of CHO cells as host cells and an expression 
vector which encodes the reverse transcriptase of an AIDS associated retrovirus. 

As more fully set out below, the diagnostic product of the present invention is utilized in place of its counterpart 
10 derived from a live pathogen in analogous immunoassays. In that regard, a commercial diagnostic test kit would include 
the above diagnostic products with a variety of other immunological products, at least one of which is labeled, for 
detection of its complementary antibody or the antigen. The system has been described with respect to the molecular 
cloning and expression of specific proteins of HTLV-ill which possess sufficient antigenic determinants to render them 
capable of specifically binding complementary antibody, namely antibody to HTLV-III. The specific techniques for clon- 
15 ing and expressing exemplary polypeptides are set forth in more detail in the examples that follow. 

There are a number of known techniques for the determination of an unknown quantity of antigen or antibody in 
biological fluids such as serum, urine, or saliva or from skin samples or the like. In principle, the present invention 
utilizes such known techniques but substitutes certain molecularly cloned diagnostic reagents of a type set forth above 
in the otherwise known procedure. 
20 Accordingly, the procedures themselves will be described only generally with reference being made to conventional 

immunology text for the details of the procedures. It would be well known to skilled workers in the field how to utilize 
the novel diagnostic products of the present invention in conventional immunological techniques. 

For simplicity of description, the general term "diagnostic product" will be used in describing the antigen functional 
product of the present invention. The term "diagnostic product" is defined as a predetermined polypeptide sequence 
25 of an Al DS associated retrovirus with one or more antigenic determinants capable of specifically binding complementary 
antibody induced by a AIDS associated retrovirus. The diagnostic product is formed in a recombinant host cell capable 
of its production. The polypeptide sequence may be either functionally associated with a surface membrane of the 
recombinant cell or it may be recovered and used free of the host cell membrane. Further, the antigenic polypeptide 
sequence may be fused to a second polypeptide sequence. 
30 The diagnostic methods used in assaying AIDS associated retrovirus, its constituent polypeptides and comple- 

mentary antibodies are conventional. These include the competitive, sandwich and steric inhibition techniques. The 
first two methods employ a phase separation step as an integral part of the method while steric inhibition assays are 
conducted in a single reaction mixture. The methodology for assay of retrovirus or its polypeptides on the one hand 
and for substances that bind retrovirus or viral polypeptides on the other hand are essentially the same, although 
35 certain methods will be favored depending upon the size of the substance being assayed. Therefore the substance to 
be tested is referred to herein as an analyte, irrespective of its status otherwise as an antigen or antibody, and proteins 
which bind to the analyte are denominated binding partners, whether they be antibodies, cell surface receptors or 
antigens. 

Analytical methods for AIDS associated retrovirus, its polypeptides, complementary antibody or cell surface re- 
40 ceptors all use one or more of the following reagents: Labelled analyte analogue, immobilized analyte analogue, la- 
belled binding partner, immobilized binding partner and steric conjugates. The labelled reagents also are known as 
■tracers". 

The label used is any detectable functionality which does not interfere with the binding of analyte and it binding 
partner. Numerous labels are known for use in immuno assay, examples including enzymes such as horseradish per- 

45 oxidase, radioisotopes such as 14 C and 131 1, fluorophores such as rare earth chelates or fluorescein, spin labels and 
the like. Conventional methods are available to covalently bind these labels to proteins or polypeptides. Such bonding 
methods are suitable for use with AIDS associated retrovirus, viral polypeptides, complementary antibody and retrovirus 
receptors, all of which are proteinaceous. 

Immobilization of reagents is required for certain assay methods. Immobilization entails separating the binding 

50 partner from any analyte which remains free in solution. This conventionally is accomplished by either insolubilizing 
the binding partner or analyte analogue before the assay, procedure, such as by adsorption to a water insoluble matrix 
or surface (Bennich et al., U.S. 3,720,760) or by covalent coupling (for example using glutaraldehyde cross-linking), 
or by insolubilizing the partner or analogue afterward, e.g., by immunoprecipitation. 

Steric conjugates are used in the steric hinderance method for homogeneous assay. These conjugates are syn- 

55 thesized by covalently linking a low molecular weight hapten to a small analyte so that antibody to hapten substantially 
is unable to bind the conjugate at the same time as anti-analyte. Under this assay procedure the analyte present in 
the test sample will bind anti-analyte, thereby allowing anti-hapten to bind the conjugate resulting in a change in floures- 
cence when the the hapten is a fluorophore. 
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Other assay methods, known as competitive or sandwich assays, are well established and widely used in the 
commercial diagnostics industry. 

Competitive assays rely on the ability of a labelled analogue (the "tracer' 1 ) to compete with the test sample analyte 
for a limited number of binding sites on a common binding partner The binding partner is generally insolubilized before 
5 or after the competition and then the tracer and analyte bound to the binding partner are separated from the unbound 
tracer and analyte. This separation is accomplished by decanting (where the binding partner was preinsolubilized) or 
by centrifuging (where the binding partner was precipitated after the competitive reaction). The amount of test sample 
analyte is inversely proportional to the amount of bound tracer as measured by the amount of marker substance. Dose- 
response curves with known amounts of analyte are prepared and compared with the test results in order to quantita- 
10 tively determine the amount of AIDS associated retrovirus, viral polypeptide or complementary antibody present in the 
test sample. These heterologous assays are called ELISA systems when enzymes are used as the detectable markers. 

Another species of competitive assay is a homogenous assay which does not require a phase separation. Here, 
a conjugate of an enzyme with the analyte is prepared so that when anti-analyte binds to the analyte the presence of 
the anti-analyte modifies the enzyme activity. In this case, a polypeptide of an AIDS associated retrovirus or its immu- 
15 nologically active fragments are conjugated with a bifunctional organic bridge to an enzyme such as peroxidase. Con- 
jugates are selected for use with complementary antibody so that binding of the complementary antibody inhibits or 
potentiates enzyme activity. This method £§r se is widely practiced under the name EMIT. 

Sandwich assays particularly are useful for the determination of polypeptides of an AIDS associated retrovirus, 
complementary antibody or retrovirus cell surface receptors, i.e., large molecules. In sequential sandwich assays an 
20 immobilized binding partner is used to adsorb test sample analyte, the test sample is removed by washing, the bound 
analyte is used to adsorb labelled binding partner and bound material then separated from residual tracer. The amount 
of bound tracer is directly proportional to test sample analyte. In a "simultaneous- sandwich assay, test sample is not 
separated before adding the labelled binding partner. 

The foregoing are merely exemplary assays for AIDS associated retrovirus, polypeptides of an AIDS associated 
25 retrovirus, complementary antibody and retrovirus cell surface receptors. Other methods now or hereafter developed 
for the determination of these analytes are included within the scope hereof. 

In order to simplify the examples certain frequently occurring and well-known methods employed in recombinant 
constructions will be referenced by shorthand phrases or designations. 

Plasmids are generally designated by a lower case p preceded and/or followed by capital letters and/or numbers. 
30 The starting plasmids or sources of DNA herein are commercially available, are publicly available on a restricted basis, 
or can be constructed from available plasmids or polynucleotides in accord with published procedures. In addition, 
other equivalent plasmids are known in the art and will be apparent to the ordinary artisan since the plasmids generally 
only function as replication vehicles for the preprotein and its control sequences, or for elements thereof in intermediate 
constructions. 

35 "Digestion" of DNA refers to catalytic cleavage of the DNA with an enzyme that acts only at certain locations in 

the DNA. Such enzymes are called restriction enzymes, and the sites for which each is specific is called a restriction site. 

The various restriction enzymes used herein are commercially available and their reaction conditions, cof actors 
and other requirements as established by the enzyme suppliers were used. Restriction enzymes commonly are des- 
ignated by abbreviations composed of a capital letter followed by other letters and then, generally, a number repre- 

40 senting the microorganism from which each restriction enzyme originally was obtained. In general,. about lug or plasmid 
or DNA fragment is used with about 1 unit of enzyme in about 20 ul of buffer solution. Appropriate buffers and substrate 
amounts for particular restriction enzymes are specified by the manufacturer. Incubation times of about 1 hour at 37°C 
are ordinarily used, but may vary in accordance with the supplier's instructions. After incubation, protein is removed 
by extraction with phenol and chloroform, and the digested nucleaic acid is recovered with aqueous fraction by pre- 

45 cipitation with ethanol. Digestion with a restriction enzyme infrequently is followed with bacterial alkaline phosphatase 
hydrolysis of the terminal 5' phosphates to prevent the two restriction cleaved ends of a DNA fragment from "circular- 
izing" or forming a closed loop upon ligation (described below) that would impede insertion of another DNA fragment 
at the restriction site. Unless otherwise stated, digestion of plasmids is not followed by 5* terminal dephosphorylation. 
Procedures and reagents for dephosphorylation are conventional (32). 

50 "Recovery" or "isolation" of a given fragment of DNA from a restriction digest means separation of the digest by 

polyacrylamide gel electrophoresis, identification of the fragment of interest by comparison of its mobility versus that 
of marker DNA fragments of known molecular weight, removal of the gel section containing the desired fragment, and 
separation of the DNA from the gel, generally by electroelution. This procedure is known generally. 

A "Western Blot" is a method by which the presence of polypeptide is confirmed by reaction with labelled comple- 

55 mentary antibody. The polypeptide is separated eletrophoretically on a polyacrylamide gel and elect rophoretically trans- 
ferred to nitrocellulose. The nitrocellulose is incubated with labelled complementary antibody, unbound antibody re- 
moved and the location of residual label is identified. 

■Transformation" means introducing DNA into an organism so that the DNA is replicable, either as an extrachro- 
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mosomal element or chromosomal integrant. Unless otherwise provided, the method used herein is the CaC^ trans- 
formation method (33). 

'Ligation 0 refers to the process of forming phosphodiester bonds between two double stranded nucleic fragments 
(34). Unless otherwise provided, ligation may be accomplished using known buffers and conditions with 10 units of T4 
5 DNA ligase ("ligase") per 0.5 u.g of approximately equimolar amounts of the DNA fragments to be ligated. 

"Fill in 0 refers to repair of sticky ended (overhanging) restriction enzyme fragments. Generally 2-1 5 u.g of DNA are 
incubated in 50 mM NAci, 10 mM Tris(pH 7.5), 10mM MgCI 2 , 1mM dithiothreitol with 250 mM of each of four deoxy- 
nucleoside triphosphates and 8 units DNA polymerase Klenow fragment at 24°C for 30 minutes. The reaction is ter- 
minated by phenol and chloroform extraction and ethanol precipitation. 
10 "Preparation" of DNA from transformants means isolating plasmid DNA from microbial culture. Unless otherwise 

provided, the alkaline/SDS method was used (34). 

"Oligonucleotides" are short length single or double stranded polydeoxynucleotides which are made by chemically 
known methods and then purified on polyacrylamide gels (35). 

The following specific examples are intended to illustrate more fully the nature of the present invention without 
15 acting as a limitation upon its scope. 

Example 1 

This example discloses plasmid construction and expression of the p-24 core polypeptide of HTLV-HI in a prokary- 
20 otic expression host. 

A. Virus Growth and Viral RNA Isolation 

HTLV-IIIB/H9 cells (ATCC CRL 8543) were obtained from Dr. Larry Arthur, Frederick Cancer Research Facility, 
25 Frederick, MD. Cultures were maintained in suspension and adapted to RPMI 1640 supplemented with 5% FBS, 2 
mM L-glutamine, and 1 00 ug/ml penicillin-strepomycin mixture. Cells were seeded at a density of 3 x1 0 5 viable cells/ml 
in 10-liter volumes, and cultures harvested at a density of 10-15 x 10 5 viable cells/ml approximately 96 hours after 
seeding. 

Following clarification to remove cells, culture fluids were concentrated by ultrafiltration. HTLV-HI virus particles 
30 were purified by banding of ultra-filtrate concentrate in a 20% to 60% sucrose gradient in a Beckman SW27 rotor for 
1 6hr at 27,000 rpm using a Beckman L8-M ultracentrige. Sucrose fractions between 30% and 40% were pooled, diluted 
with TNE (10mM tris pH 7.5, 0.1 M NaCI, 1 mM EDTA) and pelleted by differential ultracentrifugation in a Beckman Type 
50 rotor. Virus particles were resuspended in TNE at a concentration approximately 2000 times greater than that of 
the original medium. These concentrates contained 2.5 to 3.0 mg of protein per ml. 
35 Total RNA was extracted from HTLV-III3/H9 cells (berger, et al., "Biochemistry" t8: 5 143 (1979)). PolyA containing 

RNA was purified by fractionation on oligo dT cellulose (Aviv, et al., "P.N.A.S. U.S.A." 74:5463 (1977)) and used for 
cDNA synthesis. 

B. Cloning of the p-24 gene 

40 

DNA Synthesis 

PolyA RNA was primed with oligo dT for reverse transcriptase mediated cDNA synthesis (Capon, et al., "Nature" 
304:507 (1983)). Second strand DNA synthesis by DNA polyerase I was self-primed. The resulting doubled stranded 
45 DNA was treated with S1 nuclease to nick the resultant loop and to remove single stranded nucleotides. This blunt 
ended double stranded DNA was used to construct an HTLV-III phage library. 

Phage Library Construction and Detection of HTLV-III Clones 

50 The above double stranded DNA was blunt end ligated with T-4 ligase to R-1 linkers (37) which were synthesized 

by known techniques (35). This R-l/cDNA was ligated into Eco R1 digested X GT10 to generate an HTLV-III cDNA 
library as disclosed (37). 

Two copies of the phage library were replica plated onto nitrocellulose filters where the DNA was fixed for in situ 
hybridization (32). One set of filters was hybridized with ^P labelled cDNA prepared from the polyA positive RNA 
55 obtained from HTLV-III infected T-cells which was used to generate the cDNA library. The other set of filters was 
hybridized with ^P labelled cDNA obtained from the same cell line which was not infected with HTLV-III. Differential 
phage hybridization, analogous to the method employed for detecting bacterial cDNA clones (38), allowed applicants 
to identify a number of HTLV-III positive cbnes. 



9 



EP 0187 041 B1 



Subcloning and Sequencing of of HTLV-III Clones 

The above-identified HTLV-III clones from the cDNA phage library were digested with Eco Rl. The cDNA inserts 
were isolated and subcloned into the Rl site of pBR322 (ATCC 37017) and propagated in E. coli 294 (ATCC 31446). 
5 A number of these subcloned R1 inserts were sequenced by the dideoxy method (39), producing the proviral 

sequence shown in Figs. 2a-2d. 

Amino Terminal Sequencing 

10 Live HTLV-III virus particles were purified as previously described. The constituent proteins were fractionated on 

a 12% polyacrylamide gel using Laemmeli buffer (42). Approximately 600 ug of virus protein was loaded per lane under 
reducing conditions (41). The gel was stained with Coomassie blue and a protein band having a corresponding mo- 
lecular weight of 24,000 daltons was cut from the gel. The protein was electroeluted from the gel (41 ) for amino terminal 
sequence analysis (42). The amino acid sequence for the first 1 7 amino acids, excluding residue 1 6, is shown in Figure 

15 1. The corresponding DNA sequence is shown directly below the amino acid sequence. A comparison of this DNA 
sequence with that obtained for the HTLV-III genome in Figure 2 revealed one particular clone, designated p 7.11 in 
Figure 3, which contained a DNA sequence encoding this peptide sequence. This approximately 2.2 kilobase covers 
the precursor gag region and encodes, 5' to 3', p-12, p-15, p-24 a second p-15 protein, and approximately 300 extra 
base pairs 3' to the gag region. The DNA sequence and corresponding amino acid sequence of this precursor gag 

20 region is shown in Figure 2. A partial restriction map of the 220 amino acid p-24 core protein region including an R-1 
site 3' from the expected carboxy terminal amino acid is depicted for p-7.11 in Figure 3. 

Subcloning of p 7.11 

25 Plasmid 7.1 1 was digested with Eco R1 . The 2.2 kilobase insert was isolated and double digested with Rsa l and 

Pstl. The Rsa l-Pstl and Pstl -Eco R1 sequences were isolated. Plasmid UC9 (43) was digested with Smal and Eco R1 . 
The large fragment comprising most of pUC9 was isolated and hybridized with the Rsa -I Pst l and Pstl -Eco -R1 frag- 
ments. The blunt Rsa l and Sma I ends were ligated with T-4 ligase. The resulting plasmid, designated pUCp-24, was 
used to transform E. coli 294. 

30 in constructing pUCp-24 the Rsa l site of p-7.11 was used. In so doing, the first ten N-terminal amino acids were 

excluded from this construction. To incorporate these ten amino acids a synthetic oligonucleotide was constructed (35). 
The DNA sequence of this 39-mer is 

5' 3' 

55 p-GATCCCACCTATAGTGCAGAACATCCAGGGGCAAATGGT 

GGTGGATATCACGTCTTGTAGGTCCCCGTTTACCA-p 

The 5' end of this oligonucleotide contains a synthetic BamH1 site for subsequent ligation. Both strands were 
phosphorylated with polynucleotide kinase 
40 Plasmid pUC-24 was digested with Bam HI and Sph l. The large fragment containing most of the pUC plasmid and 

the 3' end of the p-24 DNA sequence was isolated. Plasmid 7.11 was digested with Rsa l and Sph l. The fragment 
containing p-24 DNA sequence 5' to the Sph I site was isolated. This fragment and the synthetic 39-mer were hybridized 
to the Bam HI-Sphl fragment obtained from pUC-24. The blunt ends from the 39-mer and Rsal-Sph l fragment were 
ligated with T-4 ligase. This plasmid, designated pUCp24L, was used to transform E. coli 294. 

45 

C. Expression Vectors for P-24 Core Protein and Truncated P-24 Core Protein 

The construction of a p-24 and a truncated p-24 expression vector are depicted in Figure 4. 

so Construction of pHGH-SRC-fsl-2 

Plasmid psRCex16 (44) contains pBR322 and a fusion of the first 69 nucleotides of HGH with the DNA encoding 
the 60,000 MW phosphoprotein of SRC, the fusion being under the control of the E. coli trp promoter. It is not important 
that the plasmid encode any protein. Instead, any pBR322 plasmid derivative which contains a trp promoter is satis- 
55 factory (45). 

Plasmid pSRCex 16 is partially digested with EcoRI and filled using the Klenow fragment of DNA polymerase I in 
order to destroy the EcoRI site in the trp promoter, thereby leaving an EcoRI site at the beginning of the HGH fragment. 
The resulting plasmid, pEARI5RCex16 was then digested with EcoRI in order to open the plasmid, trimmed with SI 
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nuclease to blunt the ends and ligated to a linker (fsl-2) (46) having the following structure: 

5' pAATTATGGAATTCCAT 3 1 

5 TTAATACGTTAAGGTAp 

Ligation with the fsl-2 linker recreates the EcoRI site immediately after the ATG, which is underlined above, at the 
EcoRI cleavage point indicated by an arrow. Accordingly, the first base of the first codon of p24/p-15, as constructed 
below, is provided by the vector. This plasmid is designated pHGH-SRC-fsl-2. 

10 Construction of a Full Length p-24 Expression Vector 

Plasmid UCp24L was cleaved with Bam HI. The cleaved fragment was treated the Klenow fragment of DNA 
polymerase I in the presence of 0.5mM nucleotide triphosphates to fill in the Bam HI restriction site. This fragment was 
then digested with EcoRI. The fragment containing the p-24 and p-15 (p-24/p-15) DNA sequence was isolated. 

15 Plasmid HGH-SRC-fsl-2 was cleaved with EcoRI and filled in as previously described. The cleaved plasmid was 

subsequently digested with Bam HI. The fragment containing the trp promoter and ATG initiation codon was isolated. 
This fragment was hybridized with a fragment confering tetracycline resistance derived from digesting pBR322 by 
digestion with Bam HI and EcoRI and with the blunt ended Bam HI -Eco-RI fragment obtained from pUCp24L The blunt 
ends were ligated with T-4 ligase. This plasmid, designated p24DE was used to to transform E.coli 294. Transformants 

20 were selected by their ability to resist tetracycline. Expression of this plasmid was expected to produce a polypeptide 
consisting of the full length p-24 core protein and the p-15 protein. 

Construction of a p-24 Fragment Expression Vector 

25 Plasmid p-24DE contains a Hind lll restriction site located in the vicinity of the DNA sequence encoding amino 

acids 178-179 of the p-24 core protein. In addition, the p-24DE contains a Hind lll site in the tetracycline resistance 
gene derived from pBR322 located 3' to the p-24 DNA sequence. Digestion of p-24DE with Hind lll followed by removal 
of the fragment encoding amino acids 179-200 and the p-1 5 protein and the religation of the fragment containing the 
5' p-24 DNA sequence under control of the trp promoter produced the plasmid designated p-DE-24AHD3. A TAA stop 

30 codon within the pBR322 sequence 3* to the truncated p-24 DNA sequence results in the expression of the first 178 
amino acids of p-24 core protein and eight amino acid residues encoded by the pBR322 DNA sequence. 

Example 2 

35 This example discloses the construction and cloning of an expression vector of p-24 core protein sequence of 

HTLV-III as a fusion polypeptide in a prokaryotic host. 

A. Expression Vectors for Fusion Polypeptides of p-24 Core Protein and Truncated p-24 Core Protein 

40 The construction of vectors for the expression of composite polypeptides for the p-24 and a fragment of p-24 are 

depicted in Figure 5. 

Construction of a p-24 fusion polypeptide expression vector 

45 Plasmid UCp24Lwas cleaved with Bam HI, blunt ended and digested with Eco RI as described above. The fragment 

containing the p-24/p15 DNA sequences was isolated. 

Plasmid HGH 207-1 (45) was digested with Pst1 and Pvu2. The sequence encoding HGH 3' from the Pst1 site to 
the Pvu ll site was isolated. This sequence excludes DNA sequences 5' from the Pst1 site as well as the last 8 amino 
acid residues of mature HGH. These two fragments were hybridized to a Pstl -Eco RI fragment derived from pBR322 
50 which contains a gene conferring tetracycline resistance. T-4 ligase was used to ligate the Pvu ll and blunt ended Bam HI 
site prior to transformation of E. coli 294 with this plasmid, designated pHGH p-24-A. 

Plasmid HGHp24-A does not encode the mature HGH amino acid sequence 5' to the Pvu-2 site of pHGH 3'R. A 
DNA sequence encoding such residues was inserted into the plasmid by the following method. 

Plasmid HGH 3*R was digested with Bglll and Cjal. The fragment containing the tac promoter and HGH 5' DNA 
55 sequences encoding mature HGH to the Bglll site was isolated. 

Plasmid HGH 207-1 was digested with BgJII and Pstl. The BgJII-Pstl fragment containing part of HGH and part of 
p-24 was isolated. The same plasmid was digested with Pstl and Clal followed by isolation of the fragment containing 



11 



EP 0 187 041 B1 



p-15andpartof p-24. 

These three isolated fragments were ligated to produce pHGHp24-B. This plasmid contains DN A sequences under 
control of the tac promoter for expression of mature HGH, except for the C-terminal 8 amino acids of HGH, fused to 
p-24 core protein and p-15 protein. 

5 

Construction of a Truncated p-24 Fusion Polypeptide Expression Vector 

Digestion of pHGHp-24B with Hind i 1 1 followed by religation results in the same deletion as that obtained for p24DE. 
The resulting sequence, however, was expected to express a fusion polypeptide of mature HGH (absent 9 C-terminal 
10 amino acids) fused to the first 178 amino acids of p-24 and eight amino acid residues from the expression plasmid 3' 
to the p-24 DNA sequence. This plasmid is designated pHGHp-24AHD3. 

Example 2a 

1* . This Example describes the isolation of cDNA encoding the gp41 envelope protein, its tailoring for expression in 
prokaryotes, expression of a p41 fusion in E. coli and the in vitro resolubilization of the expressed fusion. 

Viral mRNA was isolated, cDNA prepared from the viral mRNA, and the cDNA cloned into X GT10 phage as 
described in Example 1 A and B above. A cDNA clone, H9c.53 was identified that comprised cDNA encoding the 
complete p41 envelope region (bp 7336 to 8992, Fig. 2d). H9c.53 was digested with Rsal and Hindi 1 1 and the fragment 

20 | corresponding to nucleotides 741 7-7722 of the proviral genome was isolated. Fragment I encodes amino acid residues 
being ValGlnAlaArg and the C-terminal residues being AsnTyrThrSer). Fragment I was ligated to an excess of synthetic 
EcoRI adaptor (having the sequence AATTCATGTCTTACGGTCAAGG) with a phosphorylated blunt end and a 5' hy- 
droxyl EcoRI end as described above in Example 1B, digested with Hind i 1 1 in order to cleave ligation -formed dime rs, 
and the resulting 326 bp EcoRI -Hindi 1 1 fragment isolated on a 6% polyacrylamide gel. Plasmid pNCV(55a) was digested 

25 with EcoRI and Hind i 1 1 and the vector fragment recovered. The 326 bp, gp41 -encoding fragment was ligated to the 
pNCV fragment and the ligation mixture used to transform E. coli (ATCC 31446) to tetracycline resistance. A tetracycline 
resistant plasmid was recovered (ptrgl_Egp41). This plasmid is shown in Fig. 8A. The open reading frame under the 
control of the trp promoter encodes a gp41 fusion having at its amino terminus 1 90 residues representing the combined 
sequence of the amino terminus of the E. oolj trp E gene product and its leader together with the amino acid sequence 

30 encoded by the Eco RI adaptor, and at its c-terminus 14 residues encoded by residual pBR322 sequence. The fusion 
contained 306 residues in total. 

While the gp41 fusion represents a combined amino and carboxyl fusion of prokaryotic peptides with a gp41 frag- 
ment, it will be appreciated that, either or both of the prokaryotic sequences may be deleted or substituted for by other 
prokaryotic, eukaryotic or synthetic polypeptides. When direct expression of the DNA encoding the gp41 fragment was 

35 attempted, rather than as an amino terminal fusion, immunoreactive material could not be identified in the cell culture 
extracts. The reasons for this are unknown, but may involve instability of the unfused product in E. coli. Accordingly 
gp41 or its fragments should be expressed in prokaryotes as N-terminal fusions. Such fusions of course will include 
fusions with the secretory leaders of E. coli periplasmic or extracellular proteins, e.g. trie ST-II heat stable enterotoxin 
or alkaline phosphatase signals, among others. In this case gp41 will be secreted as a mature protein free of prokaryotic 

40 sequences. Plasmids encoding fusions of signal peptides with the mature gp41 fragment are readily constructed by 
inserting synthetic DNA encoding the known sequences of prokaryotic signals in place of the trp LE -adaptor sequence 
in ptnoLEgp41 using appropriate flanking restriction sites and adaptors or linkers as required. Any remaining extraneous 
DNA is excised using M1 3 phage deletional mutagenesis or other conventional method. 

The gp42 DNA was truncated to the 1 02 residue fragment so as to avoid including sequences near the amino and 

45 carboxyl terminus that are particularly rich in hydrophobic residues, specifically the two long hydrophobic stretches of 
28 and 22 amino acids contained within the putative transmembrane region of gp41 . Hydrophobic regions are known 
in some instances to be deleterious. to E. coli growth (56, 57). The region extending from residues 29-130 relative to 
the amino terminus of gp41 did not contain these hydrophobic regions. Note, however, that other host cells such as 
mammalian cells are suitable for expression of the intact gp41 protein or gp41 fragments containing one or both terminal 

50 hydrophobic regions/ 

E. coli (ATCC 31446) was transformed with ptrpLEgp41 and grown in M9 minimal media until values at A^q of 
6.6, 9, 12, 17 and 21 were reached. Total cellular proteins were electrophoretically separated on 10% SDS polyacry- 
lamide gel and stained with Coomasie blue. The results are shown in Fig. Bb, wherein lanes a-e correspond to the 
above densities in ascending order, while lane f depicts total cellular protein from a pBR322-transformed control (k 
55 represents 1,000 daltons). E. coli transformants accumulated a prominent protein (LEgp41 , at about 10% by weight 
of the total intracellular bacterial protein) with an apparent molecular weight of 33,000 daltons, consistent with the 
predicted size of 306 amino acids. Recombinant LEgp41 was purified from retractile bodies enriched for the protein 
by gel filtration and ion-exchanged chromatography using procedures similar to those previously described for trpLE- 
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foot and mouth disease virus polypeptide fusions (55a). Specifically, cells were washed in 1 0mM Tris-HCI, 1 mM EDTA, 
resuspended in 4 volumes of the same buffer, sonicated, and the retractile bodies collected by cent rifugat ion at 5,000 
rpm for 30 minutes in a Corvall GSA rotor. The particles were washed twice and solubilized with 7M guanidine HCI, 
0. 1 % p-mercaptioethanol (BME) . The resulting extract was clarified at 1 9,000 rpm for 4 hours in a Beckman TOO rotor. 

s The high speed supernatant was passed through a Sephacryl S-300 (Pharmacia) column. Fractions from the included 
volume containing LEgp41 were identified by electrophoresis on SDS-polyacrylamide gels, pooled and dialysed ex- 
tensively against 7.5M urea, 0.1% BME. This pool was chromatographed on DEAE-52 cellulose (Whatman) in 7.5M 
urea, 0.1% BME, 10mM Tris-HCI pH 8.5. LEgp41 material that flowed through the column was pooled and dialysed 
into 4M urea, 1 0mM Tris HCI pH 8.5 at a concentration of 0.2 mgs/ml. The purified LEgp41 appeared greater than 95% 

jo homogeneous on SDS-polyacrylamide gels by silver staining. LEgp41 was not glycosylated. 

This purification procedure eliminated the need to preadsorb patient sera with E. coli proteins where the sera is 
destined for assay of its anti-AIDS antibody level using the LEgp4l fusion; apparently there are little or no pre-existing 
antibodies to the LE portion in patient sera, at least no antibodies in the conformation in which they exist in LEgp41. 
This also suggests that immunization programs using fusions of bacterial proteins such as LE with HTLV-III sequences 

15 presents a low risk of inducing a hyperimmune response, notwithstanding the fact that humans are commonly exposed 
to such proteins. Prokaryotic protein fusions for use in vaccines are identified by screening pooled sera from normal 
individuals for the presence of antibodies that cross-react with the candidate fusion. Conventional methods such as 
Western Blotting or radioimmunoassay are used for the screening, for example as shown below in Example 2b. 

20 Example 2b . 

Enzyme-linked immunosorbent assay (ELISA) for the detection of antibodies to LEqp41 

Purified LEgp41 (Example 2a) was coated in 96-well microtiter plates in 0.05 M sodium carbonate pH 9.6 as 

25 described (58). Sera were diluted 1:100 fold in TBS and incubated in coated microtiter wells for 1 hour at room tem- 
perature. Following 3 washes with TBS the plates were incubated for two hours at room temperature with antibody- 
enzyme conjugate (mouse monoclonal anti-human IgG Fc-horseradish peroxidase, Travenol-Genentech Diagnostics). 
Plates were again washed with TBS, developed with 3,3\5,5'-tetramethylbenzidine, and color development measured 
in a plate reader (Dynatech) at 590 nm. Samples with absorbance values greater than twice the mean of negative 

30 control sera were scored as positive. 

Significant antibody levels to purified LEgp41 were detected in sera from 125 of 127 (98%) clinically diagnosed 
AIDS patients. By comparison, none of 300 sera from random blood donors previously shown to be seronegative for 
HTLV-III using a commercial whole virus ELISA test kit were found to react with LEgp41 above the cutoff absorbancy 
(OD590>0.12). The two AIDS patient sera which did not react with LEgp41 were the only AIDS sera non reactive by 

35 commercial whole virus ELISA. One of these sera was found to react with whole virus on Western Blots but this reactivity 
could be attributable to p24 core antigen. The other LEgp41 negative AIDS sera did not react with virus on Western 
Blots, although virus could be cultured from this patient. The reactivity of LEgp41 with the 1 25 positive AIDS sera was 
highly sensitive, ranging from 0.21-1.64 OD590 units, with only 3 sera giving OD590 values of <0.4 units. Thus, re- 
combinant LEgp41 shows excellent specificity and sensitivity in detecting antibodies directed against the AIDS retro- 

40 virus. 

The purified LEgp41 ELISA similarly detected antibodies found in sera from ARC (AIDS-related complex) patients 
and healthy homosexual men. Sixty of 69 (86%) ARC patient sera reacted with LEgp41 , yielding essentially identical 
results as the commercial whole virus ELISA. Of the 9 nonreactive ARC samples, Western blot analysis with whole 
virus revealed that two of the sera were positive for p24 only, while the remaining seven sera were completely nonre- 

45 active although virus could be recovered from four of these patients. 

As expected, a small fraction of sera obtained from asymptomatic homosexual men (26/75, 34%) was reactive in 
the LEgp41 ELISA. The 26 LEgp41 ELISA-positive sera from this cohort included all 24 positive sera detected by the 
commerical whole virus ELISA and were confirmed positive by Western Blots utilizing whole virus. Significantly, two 
of. the LEgp41 seropositive samples from healthy gay men were found to be repeatably nonreactive in the commercial 

50 whole virus ELISA. 

There was no evidence of nonspecific reactivity in the LEgp41 ELISA with sera from six patients with diseases 
that characteristically give false-positive responses in other serological tests (systematic lupus erythematosis, rheu- 
matoid arthritis, heterophile positive mononucleosis or Goodpasture's syndrome). As an additional test of specificity, 
sera previously giving false-positive reactions in a commercial whole virus ELISA were examined by the LEgo41 ELISA. 
55 A group of 58 sera which had given nonrepeatable positive results with the commerical whole virus ELISA were found 
not to react in the LEgp41 ELISA. Furthermore, 25 sera which gave repeatable positive results in the commercial ELISA 
test, but which were negative in more accurate Western Blot assays, also did not react in the LEgp41 ELISA. 
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Example 3 

Immunoassay of expressed recombinant polypeptide 

s A. Expression of full length and truncated p-24 and p-24 fusion polypeptide 

E. coli 294 containing each of the Example 2 plasmids is first grown in L broth which is rich in tryptophan. This 
medium represses the trp promoter. Expression of recombinant polypeptide is induced by transferring the culture to 
M-9 medium, which does not contain tryptophan, after the L-broth culture has obtained an O.D of approximately 0.5 
jo at 550 nm. After about one hour, indole acetic acid is added to the medium to produce a final concentration of about 
10ug/ml to further induce expression. After an additional 5-6 hours at 37°C the cells are collected by centrifugtation, 
lysed (47) and subjected to Western blot analysis. 

B. Immunological Assay of Expressed Polypeptides 
is " *° — 

Rabbit Anti-HTLV-lll 

The polypeptides expressed by the above-identified plasmids were assayed for immunological reactivity with serum 
from rabbits inoculated with Triton treated HTLV-III retrovirus and which were subsequently boosted with live HTLV-III 
20 virus. 

A Western blot of p24DE and p24DEAHD3 polypeptides and pHGHp24B and pHGHp24AHD3 fusion polypeptides 
expressed in M-9 medium supplemented with indole acetic acid and reacted with HTLV-III rabbit antiserum and 125 l 
labelled Protein A is shown in Figure 6. A number of immunologically reactive bands for the p24DE polypeptide are 
apparent in lane 3. The band corresponding to a molecular weight of about 44,000 daltons corresponds to an expressed 

25 protein comprising p-24 and p-15. Surprisingly, two doublets with molecular weights of approximately 24,000 and 
15,000 were also observed. The cause of these doublet signals is not presently understood. However, the detection 
of these doublet signals with molecular weights which are consistent with the gag proteins produced naturally from the 
polycistronic gag region indicate that this strain of E. coli is capable of processing such precursor polypeptide sequences 
to produce HTLV-III proteins which may be detected by complementary antibody. A Western blot of p-24DEAHD3 

30 polypeptide is also shown in lane 4 of Figure 6. Again, a doublet having a molecular weight of about 20,000 daltons is 
observed. The absence of a 44,000 dalton p-24/p-1 5 full length HTLV-III protein and the 1 5,000 dalton doublet detected 
for p24DE is consistent with the above interpretation of the p24DE Western blot. 

Lanes 5 and 6 of Figure 6 contain respectively the immunologically reactive proteins expressed by pHGHp24B 
and pHGHp24AHD3. The band in lane 5 having a molecular weight of approximately 65,000 daltons is consistent with 

35 the expected composite polypeptide constructed. Further, the band in lane 6 has the expected molecular weight of 
about 45,000. 

Assay with Human AIDS Serum 

40 The polypeptide expressed by plasmid p24DE was assayed for immunological reactivity with serum from an indi- 

vidual afflicted with AIDS. In Figure 7, strip 3 depicts the Western blot obtained from p24DE polypeptide treated with 
human AIDS serum and. 125 l labelled Protein A. A diffuse band having a molecular weight of approximately 24,000 
daltons can be seen. Strip 2 is a control which contains p-24DE polypeptide treated with normal human serum and 125 l 
labelled protein A. Strip 1 contains size markers. This assay indicates that polypeptides of the present invention can 

45 be used to detect complementary antibody made in response to exposure to or infection by an AIDS associated ret- 
rovirus. 

Vaccine 

so The vaccines of the present invention are contemplated and comprise the predetermined polypeptides and fusion 
polypeptides previously described. The following examples disclose the use of vaccines containing p-24 polypeptide 
sequences from HTLV-III. However, any polypeptide sequence of an AIDS associated retrovirus, especially those en- 
coded by the env region of the retrovirus genome, may be used. 

55 
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Example 3 

Immunization of Mice 

5 The polypeptides encoded by p-24DE and p-24DEAHD3 are used to immunize BALB/C mice. Each mouse is 

immunized with 5-25 ug of p-24DE or p-24DEAHD3 polypeptide contained in 200 ul of an emulsion consisting of 50% 
aqueous antigen and 50% complete Freund's adjuvant. Each mouse is immunized at multiple intradermal and subcu- 
taneous sites as follows: 25 ul in each rear foot pad, 50 ul in the tail and 100 ul distributed among 3-5 intradermal sites 
along the back. A control group is similarly injected with emulsion which does not contain antigen.. Four weeks after 

10 primary immunization the mice are boosted with 5-25 ug of polypeptides as above with the exception that the emulsion 
was prepared with incomplete Freund's adjuvant. For the booster immunization each mouse receives 200 ul of the 
antigen emulsion or emulsion lacking antigen distributed as follows: 50 ul in the tail and 50 ul distributed among intra- 
dermal sites along the back. Three weeks after boosting approximately 500 ul of blood is collected from each mouse 
by tail bleeding. The sera obtained from this blood is used for in vitro neutralization studies. 

15 

In Vitro Neutralization Studies 

Sera from mice immunized with p-24DE or p-24DEAHD3 polypeptide and from mice in the control group are tested 
for the ability to neutralize HTLV-III in vitro. 25 ul of serially diluted mouse serum (2-fold dilution: 1:8 to 1: 16384) is 
20 incubated with 175ulof HTLV-III concentrate for one hour at 37° in Dulbecco's modified Eagle medium. After incubation, 
each dilution is applied to approximately 40,000 HUT-78 T-cells (48) contained in each well of a 96 well tissue culture 
plate. After 3-4 days incubation, virus growth is determined by an immunofluoresence assay for HTLV-III (49) or by a 
reverse transcriptase assay The absence of HTLV-III infection of HUT-78 T-cells indicates that the polypeptide induces 
the production of neutralizing antibodies to HTLV-III in mice. 

25 

Primate Assay for Neutralizing Antibodies 

The polypeptides which raise neutralizing antibodies in mice are used to immunize chimpanzees which are known 
to develop generalized lymphadenopathy when infected by HTLV-III retrovirus. Each chimpanzee in an experimental 
30 group is immunized with 50-100 ug of polypeptide contained in a 200 ul emulsion consisting of 50% aqueous antigen 
and 50% complete Freund's adjuvant. Each chimpanzee is immunized as follows at multiple intradermal sites. A control 
group is injected with emulsion not containing antigen. After four weeks the chimpanzees in each group are boosted 
with 200 ul of the emulsion or antigen emulsion as above except that the emulsion is prepared with incomplete Freund's 
adjuvant. 

35 Five weeks after boosting, the chimpanzees in the experimental and control group are challenged with various 

doses of HTLV-III concentrate. Those polypeptides which prevent the development of lymphadenopathy in the exper- 
imental group of chimpanzees are candidates for a vaccine which is capable of inducing the production of neutralizing 
antibodies in humans which resist infection by AIDS associated retrovirus. 

Because complete Freund's adjuvant is not acceptable for use in humans, the above identified polypeptides which 

40 prevent lymphadenopathy in chimpanzees challenged by HTLV-III are formulated with an adjuvant suitable for human 
use. Such formulation may comprise alum precipitated polypeptide complexes (22) which are used to immunize chim- 
panzees in an experimental group. A control group is vaccinated with adjuvant alone. Formulations which prevent 
lymphadenopathy in the experimental group comprise a polypeptide in admixture with a pharmaceutically acceptable 
vehicle which may be used as a human vaccine. 

45 

Example 5 

The composite polypeptides encoded by pHGH p-24-B and pHGHp24AHD3 are used in a manner analgous to 
that disclosed in Example 4 to determine which composite polypeptide confers resistance to AIDS infection in chim- 

50 panzees as evidenced by the absence of the development of lymphadenopathy in immunized chimpanzees challenged 
with HTLV-III retrovirus. The use of HGH polypeptide sequences in such composite polypeptides, however, is not 
preferred for a human vaccine against infection by AIDS associated retrovirus since such composite polypeptides may 
induce an autoimmune response against HGH in human subjects. Accordingly, a composite polypeptide vaccine to 
resist infection by an AIDS associated retrovirus in humans should consist of a predetermined sequence of an AIDS 

5 $ associated retrovirus or fragment thereof expressed as a fusion polypeptide with a secondary polypeptide sequence 
which is not capable of inducing a substantial autoimmune response to polypeptides naturally occurring in humans. 
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AIDS Associated RNA Dependent DNA Polymerase 

The polypeptide sequence of an AIDS associated RNA dependent DNA polymerase (AIDS reverse transcriptase) 
is contemplated to be used in an assay to identify compounds which inhibit such transcriptase activity and which may 
s be used as a pharmaceutical agent to inhibit infection by AIDS associated retrovirus or dissemination of such retrovirus 
in infected individuals. Suramin, a known inhibitor of AIDS reverse transcriptase (53), is disclosed in the following 
example. However, the assay of other compounds for transcriptase inhibition is contemplated because of the severe 
clinical side effects that Suramin elicits when administered to humans (53). Examples of such selective inhibitors have 
been disclosed by several laboratories (54). 

10 

Example 6 

A. Cloning and Expression of AIDS Reverse Transcriptase 

is The region of the viral genome encoding the AIDS associated retrovirus reverse transcriptase of HTLV-III termi- 

nates at nucleotide 4674. Its N -terminus is located within the region extending from nucleotide 1639 to about nucleotide 
1800. For the purposes described below it is preferred to select an amino terminus that is located within the gag region 
(1639-1769), thereby encompassing all optional amino terminii. The region located proximate to nucleotide 1800 in- 
cludes a number of suitable met codons which are useful to the start codons for constructions expressing the reverse 

20 transcriptase. However, it is not critical thatjn sjtu met codons be selected. Instead, an exogenous, vector-borne ATG 
codoh is ligated to partial exonuclease digests or M1 3 deletion mutants of the 5' region of the reverse transcriptase 
gene. Constructions that are properly in-frame are easily identified by the ability of transformant cell extracts to reverse 
transcribe RNA. Alternatively, the normal in vivo N -terminus for the reverse transcriptase as processed in a given host 
is determined by purifying the enzyme from infected cells, sequencing the amino terminus in accord with methods 

25 known per se and identifying the cDNA nucleotide sequence encoding the amino acid sequence of the amino terminus. 
Preferably, the cDNA of the figures is digested with BpJI I, which cleaves at nucleotide 1642, and Sail (5367). The 3725 
bp fragment is recovered. N-terminal cleavage sites other than BgJII are at 1738 (Ddel) and 1754 (AM). However, 
these enzymes also cleave at points internal to the reverse transcriptase gene. Thus, it would be necessary to conduct 
a partial digest with Dde l or Alul in order to expect to recover a full length gene. Another cleavage site other than Sajl 

30 which is located distal to the reverse transcriptase gene is that of Stul (at 4987). 

Having isolated DNA encoding the reverse transcriptase gene which bears terminal ligation sites, the DNA is ligated 
into a replicable vector. The vector will be selected depending upon the intended host, and this in turn on whether the 
DNA is to be simply replicated or whether it will be desired to use the vector for expression of the enzyme. Ordinarily, 
the vector will contain a bacterial origin of replication and an antibiotic selection gene, e.g. for tetracycline resistance. 

35 These elements are available in a large number of publicly known plasmids such as pBR322. Since it is preferred to 
express the reverse transcriptase in higher eukaryotic cells, the vector will also contain elements necessary for the 
stable replication of the gene, for identification of transformants, for promoting the transcription of the gene and for 
properly terminating the transcribed mRNA in higher eukaryotes. Typically, these will be respectively,* the SV40 origin 
of replication and a source of T antigen (such as supplied by chromosomal integrants found in such cell lines as COS- 

40 1 , available from Cold Spring Harbor Laboratories), a gene encoding mouse thymidine kinase, the SV40 early or late 
promoters and the Hepatitis B surface antigen polyadenylation site. Such vectors are known, for example, see EPA 
73,656; 92,182 and 93,619 all of which are incorporated by reference. 

Known vectors may not contain convenient restriction sites for direct insertion of the reverse transcriptase DNA 
obtained as described above. This will not constitute an obstacle to those skilled in the art since methods are known 

45 £§I i§ for introducing new restriction sites or for converting cohesive-ended restriction terminii into blunt ends. For 
example, the hepatitis surface antigen gene in pHS94 of EPA 73,656 is excised by EcoRI and Bam HI digestion, EcoRI 
and Bam HI linkers ligated to the Bgl 1 1 and Sail sites of the cDNA fragment, respectively, and the modified cDNA ligated 
into the vector fragment obtained from the pHS94 digestion. Then the construction is completed in accord with EPA 
73,656. 

so The vector bearing the reverse transcriptase gene is transf ected into permissive hosts such as the COS-1 monkey 

kidney cell lines or the CHO (Chinese hamster ovary) cell line. Other stable eukaryotic cell lines may be employed as 
hosts, as well as yeast and prokaryotes where appropriate origins of replication and promoters are provided (for yeast, 
the two micron origin and a promoter such as that of metal loth ionein, and for bacteria the pBR322 origin and a promoter 
such as trp which is described elsewhere herein in connection with expression of the gag and envelope proteins of the 

55 AIDS associated retrovirus in prokaryotes). 

In this connection, other AIDS associated retroviral proteins such as gag and envelope polypeptides may be ex- 
pressed in eukaryotic cells, whether yeast or mammalian, as such cells bear a more immediate phylogenetic relation- 
ship to the normal retroviral hosts than do prokaryotes. 
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The reverse transcriptase is preferably synthesized in recombinant culture under the control of an inducible pro- 
moter so that any toxic effects of the polymerase on the cell are minimized until a generous amount of mRNA has 
accumulated. Alternatively, the gene encoding the enzyme is ligated at its 5' end to a signal sequence recognized and 
processed by the intended host, which in the case of higher eukaryotes, for example, include the known secretion 
5 signals for interferons, secreted hormones like insulin or viral surface antigens. 

B. Assay of Reverse Transcriptase Inhibition 

A cell culture expressing AIDS reverse transcriptase is lysed and divided into aliquots. Various amounts of the 
10 compound to be assayed are added to each aliquot of lysate. An oligonucleotide of polyA together with oligo dT primers 
and 32 P labelled TTP is added to each aliquot. After incubation at 37°C, the amount of acid precipitated counts is 
determined. Those compounds which inhibit AIDS reverse transcriptase but which do not comparatively inhibit human 
DNA polymerase are then selected for further in vitro toxicity and efficacy studies. 

Alternatively, AIDS reverse transcriptase is recovered from an expression host by methods known £er se, including 
15 immunoaffinity adsorption, Sephadex gel filtration, ion exchange chromatography and electrofocusing on native poly- 
acrylamide gels. The recovered reverse transcriptase is then used to assay compounds as described above. 

EXAMPLE 7 

20 Cloning and Expression of the E' Polypeptide 

An E' polypeptide of AlDS-associated retrovirus is defined as the 206 residue polypeptide designated M E ,n in Fig. 
2, its naturally-occurring alleles, or its amino acid sequence variants which are immunologically cross-reactive with 
antisera capable of binding the E' polypeptide produced in cells infected with AlDS-associated retrovirus. In addition 

25 to the.C-terminal deletional derivatives described below, other deletions, insertions or substitutions that do not sub- 
stantially change the immunoreactivity of the peptide with sera from patients infected with HTLV-III or other AlDS- 
associated retro viruses are included within the scope of the term M E' polypeptide" or B E' protein" as used herein. A 
further example of a deletional variant is E- protein in which the first 1 9 residues, through Arg 19 , are deleted. The trpLE 
fusion described below is an insertional variant. Other variants will be apparent to the ordinary artisan. They are readily 

30 produced by methods of recombinant synthesis or, in the case of certain deletions, by proteolytic hydrolysis. 

In accordance with this invention, E' polypeptides are produced by recombinant methods so as to be free of proteins 
from AlDS-associated retrovirus- infected cells that are not encoded by the AlDS-associated retrovirus. Such polypep- 
tides obviously are not present in AlDS-associated retrovirus virions. 

cDNA clone H9.c53 contained the E' sequence. The E' sequence is shown in Fig. 2 commencing at nucleotide 

35 8375. The first stop codon in reading frame downstream from the ATG at 8375 is an OP stop codon at nucleotide 8993. 
The Cys residue immediately preceding this stop codon presumably is the C-terminus of the E' polypeptide. However, 
other C-terminii upstream from this Cys residue also are within the scope of this invention. E' polypeptides include 
sequences that are C-terminated at any residue within about the last 50 residues shown in Fig. 2, preferably immediately 
adjacent and downstream from a lysinyl or argininyl residue. 

40 An expression plasmid for the synthesis of a fusion of the E' polypeptide of Fig. 2 with a bacterial polypeptide is 

described hereafter. One aliquot of H9.c53 was digested with Hae lll and Xhol and the 62bp fragment coding for E' 
residues 14-34 recovered (fragment 1). Another aliquot was digested with Xho l and Hind lll and the 719bp fragment 
was recovered which encodes amino acids 35-206 and contains the stop codon followed by untranslated 3' sequence 
(fragment 2). 

45 a synthetic oligonucleotide (fragment 3) was prepared (35) having the following sequence encoding the first 13 

residues of the E' polypeptide flanked at its 5' end by an EcoRI cohesive terminus and ending with a 3' blunt end for 
ligation to the Hae lll -generated blunt end of fragment 1 . 

PAATTCATGGGTGGCAAGTGGTCAAAAAGT 
50 H(>^TACCCACCGTTCACCAGTTTTTC 

pNCV(55a) was digested with Eco RI and Hind lll, which are unique sites in pNCV, and the vector fragment recov- 
ered. This vector fragment was ligated simultaneously to fragments 1, 2 and 3, the ligation mixture transformed into 
E. coli 294 and antibiotic resistant colonies identified. ptrpLE-E' was obtained from a resistant colony by preparation 
of plasmid DNA. The structure of this plasmid is shown in Fig. 9a. 
55 ptrpLE-E' contains a continuous open reading frame encoding a bacterial (E. coli). derived protein (LE) fused at its 

C-terminus to the full E' sequence. 

This plasmid was employed as follows to prepare the E' protein fusion (LE-E'). ptrjgLE-E' was transformed into E. 
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coli strain 294, grown overnight in LB broth (32) containing 5 mcg/ml tetracycline, diluted 1:50 into M9 broth (32) 
containing tetracycline and grown at 37°C to an absorbance of 0.5 at 550 nm. Total cell proteins were prepared from 
25 ml of induced ceil culture. Cells were resuspended in 1/250 volume of 0.01 M Tris-HCI pH7.5, 0.001 M EDTA, 0.03 
M mercaptoethanol and 0.8% SDS, boiled for 2 minutes and precipitated with 3 volumes of cold acetone. The precip- 

5 itated proteins were redissofved by boiling in SDS-polyacrylamide gel loading buffer (40). 

Initial attempts to directly express unmodified E* sequences in E. coli were unsuccessful due to an apparent insta- 
bility of the protein. However, the foregoing method gave efficient expression of the E' polypeptide in E. coli. LE is a 
protein of 1 90 residues derived from translated sequences of the trg leader and trgE gene product (55b), which forms 
stable, insoluble aggregates in vivo and has been used successfully to stabilize synthesis of other foreign proteins in 

10 E. coli (55c, 55a). Expression of the LE-E* fusion protein from ptrgLE-E* is under the control of an E. coli trg promoter 
and trg leader ribosome binding site (Fig. 9a). Cultures of E. coli transformed with ptrpLE-E' were grown under condi- 
tions of tryptophan depletion as described above to derepress the trg promoter, and their proteins analysed by elec- 
trophoresis on 10% SDS-polyacrylamide gel and staining with Coomassie blue. As shown in Fig. 9b, cells containing 
plasmid ptnoLE-E* accumulate a prominent protein of Mr 41,000 (lane b), which is not present in cultures of E. coli 

is transformed with pBR322 (lane c). The Mr 41,000 protein has two important characteristics of the expected LE-E' 
fusion protein. First, this protein is seroreactive with antisera to LE, and second, it has an Mr of 22,000 greater than 
LE (Fig. 9b, lane a) consistent with the size predicted for a protein containing. 206 additional amino acids. 

To determine whether LE-E' exhibits antigenic sites recognized by human sera following exposure to the AIDS 
retrovirus, total cell proteins extracted f rom E. coli transformed with ptrgLE-E' were employed as an antigen in a Western 

20 blot assay. Note that individual sera were first preadsorbed with a soluble extract of E. coli proteins (blocking extract), 
although this was later found to be unnecessary. 

In the Western blot assay, total cellular proteins were prepared from induced cultures of E. cpji transformed with 
ptrpJ-E-E' (Fig. 9a) or ptrgLE-gp41 (described above) and electrophoresed on SDS-10% poylacrylamide gels as de- 
scribed above. Proteins were electrophoretically transferred to nitrocellulose sheets as described(55e). Individual blot 

25 strips were incubated overnight at room temperature with a 1 :200 dilution of the indicated sera in TBS buffer (0.025 M 
Tris-HCI pH7.2, 0.15 M NaCI and 0.05% Tween-20) containing 5% normal goat serum and 5 mcg/ml protein blocking 
extract from E. coli transformed with pBR322. Blocking extract was prepared by sonicating bacteria in 0.01 M Tris-HCI 
pH 7.5, 0.15 M NaCI and 0.5% NP40, and removing the insoluble residue by cent rifugat ion at 10,000xg. Total cell 
proteins prepared from E. coli transformed with pNCV or ptrgLE-E' were employed for preadsorption of sera with LE 

30 or LE-E' proteins, respectively. After extensive washing in TBS buffer, the strips were successively incubated with 
biotinylated goat anti-human or ant i -rabbit IgG (Vector Labs), avidin conjugated horseradish peroxidase (Miles-Yeda) 
and developed with peroxidase substrate (0.5 ng/ml4-chloro-1-napthol and 0.16% hydrogen peroxide in TBS). Fol- 
lowing each incubation, strips were washed extensively with TBS. 

Sera from 37% of AIDS patients (17/46), 81% of ARC patients (21/26), and 39% of healthy homosexual men 

35 (29/75) tested gave a clear reaction with the LE-E' immunoblots, while none of 37 sera from random blood donors was 
found to react (Table 1 below). Identical results were obtained for a representative subset of these sera in Western 
blot experiments in which total cell proteins from E. coli transformed with plasmid ptrp_LE17-E', a derivative of ptrpLE- 
E', were utilized as the antigen. These cells express LE1 7-E\ a fusion protein that differs from LE-E' in that only residues 
1-17 of the LE protein are present, indicating that the seroreactivity detected between AIDS-related sera and LE-E' 

40 fusion proteins is specific for epitopes associated with E' sequences. 



Table 1 



Prevalence of antibodies to bacterial LE-E' in AIDS risk groups 


Risk 


E'+ 


gp41 + 


E'+/gp41 + 


E'+/gp41- 


E'+/gp41 + 


E'+/gp41- 


WB+ 


WB+gp41 + 


group 


















AIDS 


* 17/46 


44/46 


16/46 


1/46 


28/46 


1/46( a ) . 


45/45 


44/46 


ARC 


21/26 


22/26 


19/26 


2/26 


3/26 


2/26 


22/26 


22/26 


HHM 


29/75 


24/75 


18/75 


11/75 


6/75 


40/75 


24/75 


24/75 


cntrls 


0/37 


0/37 


0/37 


0/37 


0/37 


0/37 


N.D.( b > 


N.D. 


E', reactivity in LE-E' western blot assay; gp41 , reactivity in LE-gp41 western blot assay (see Examples above); 


WB, reactivity in whole virus western blot assay; HHM, healthy homosexual males; controls, random blood donor 


samples. 



















l a * This individual was positive in an HTLV-III cultivation assay 



t D > All 37 control sera were seronegative in a commercial whole virus EUSA (Abbott) 



Two major points are evident from this comparison of LE-E' and gp41 seroreactivity in different AIDS risk groups. 
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First, LE-E' seroreactivity is far less frequently detected than gp41 seroreactivity in AIDS patients (37% vs. 96%), but 
with roughly the same frequency with sera from ARC patients (81% vs. 85%) and healthy homosexual males (39% vs. 
32%). Secondly, a significant number of the healthy homosexual males were seropositive for LE-E' but not gp41 (1 1 /75). 
The fact that antibodies to E' are detectable sooner than antibodies to eny antigens in a statistically significant number 

s (11/75) of healthy homosexual males, a group at high risk for developing AIDS and ARC, as well as a smaller number 
of AIDS and ARC patients (3/72), was completely unexpected and surprising particularly since no protein of the Mr of 
E 1 has been identified as a virion component and, because E' contains no apparent secretory leader, it would not be 
expected to find the protein in the serum. Seroconversion to eny antigens was monitored by Western blots utilizing 
either whole HTLV-III virus (55e) or a recombinant LE-gp41 antigen (supra), each assay having a reliability of >98% 

JO jn detecting sera from clinically diagnosed AIDS patients. Since many of the healthy homosexual men studied here 
were seronegative for E' as well as virus antigens and consequently may not have been exposed to the virus, the 
actual frequency of E'+gp41- individuals in this sample may be as high as 31% (11/35). These results may be related 
to previous observations that HTLV-III can be isolated from the lymphocytes of symptom-free seronegative persons 
(55m), and have considerable practical importance for the early clinical diagnosis of and screening for AIDS retrovirus 

15 exposure. 

In contrast to the situation found for asymptomatic individuals with virus exposure and ARC patients, antibodies 
to E' are detected at a much lower frequency than antibodies to gp41eny in AIDS patients (37% vs. 96%). A similar 
pattern of less frequent reactivity among AIDS patients has been previously noted for the core protein p2 4qag relative 
to the major envelope protein gp120env in radioimmunoprecipitation analysis (55n). It is unclear whether this phenom- 

20 enon represents a differential susceptibility to developing AIDS, but more likely reflects the selective loss of antibody 
titers or lower titers to E' and p24gag than to viral envelope antigens. 

To obtain direct evidence for E' expression in HTLV-III -infected cells, high titer antisera specific for E' were prepared 
by immunizing rabbits with LE-E' isolated by preparative SDS-polyacrylamide gel electrophoresis. The region, of the 
gel containing LE-E' (about 100 meg) was excised, homogenized in incomplete Freund's adjuvant and administered 

25 subcutaneously at two week intervals. The rabbit sera were screened for the ability to immunoprecipitate E'-related 
proteins from extracts of HTLV-III -infected cells metabolically labelled with 35 S-cysteine. The publicly available H9/HT- 
LV-IIIB cell line was employed for this purpose, since we had previously observed that these cells produce a high level 
of 1 .7-1 .9 kb mRNAs potentially capable of encoding an E' translation product (~1% of polyA+ RNA) (55e). Following 
four injections of antigen, sera obtained from two rabbits were capable of immunoprecipitating 5-6 distinct proteins of 

30 Mr 23,500-28,000 from H9/HTLV-IIIB cells. Preimmune sera from the same animals failed to immunoprecipitate any 
one of these proteins. Furthermore, the immune sera did not detect any of these proteins in uninfected H9 cells. The 
proteins detected in H9/HTLV-IIIB cells by the immune sera included three major proteins of Mr 28,000, 25,500 and 
25,000 and two minor proteins of Mr 26,500 and 23,500. In some experiments the Mr 23,500 protein could be resolved 
into two distinguishable species. 

35 LE-E' positive sera from 3 AIDS patients and 1 ARC patient were capable of immunoprecipitating proteins from 

the HTLV-III infected cells of .-Mr 28,000 and 25,000 that were not precipitated by sera from control donors. The im- 
munoprecipitation of these proteins could be only partially displaced by LE-E' protein, however, suggesting that the 
LE-E'+ AIDS and ARC sera recognize E 1 determinant(s) not displayed by the recombinant protein. 

This rabbit antiserum contained antibody capable of binding AIDS-associated retroviral E' polypeptide but was 

40 free of any antibody capable of binding any other AIDS-associated retroviral-encoded polypeptide as well being free 
of bound E' polypeptide. It conventionally is immobilized to facilitate separations. For example, the antiserum or antibody 
is bound or adsorbed to a polyolefin, e.g. polystyrene microtiter plate wells, or to matrices to which anti-rabbit IgG (e. 
g. goat) has been preadsorbed. 

The E' protein next was expressed in recombinant mammalian cell culture. Applicants' starting plasmid was ob- 

45 tained from pFD11 (EP 1 17,060 A) by a complex procedure using certain plasmids conveniently available to applicants. 
This procedure is not preferred for use by the art. Instead, the starting plasmid is preferably obtained from pFD11 by 
the following method. pFD11 is digested with Hind lll at its unique Hind lll site and the vector recovered. An adaptor 
having the sequence 

AGCTTGGATCCTTTTTATCGATA 
50 ACCTAGGAAAAATAGCTATTCGA 

is ligated to the vector fragment and transformed into E. coli 294. pFD11d Is obtained from an ampicillin resistant colony. 
The ligation of the adaptor into pFD1 1 introduces BamH 1 and Clal sites immediately upstream from the mouse DHFR 
gene. pFD11d is digested with BamH 1 and Cla1 and the vector fragment isolated. 
^ H9C.53 was digested with BamH 1 and Tagl and the fragment (fragment 4) was isolated that contained the portion 

of the eny gene downstream from the BamH1 site at nucleotide 8053, the E' coding region and its untranslated 3' region 
through to nucleotide 232. The untranslated 3' region included the 3' long terminal repeat (LTR). The pFDIld vector 
fragment (supra) was ligated to fragment 4 and E. coli 294 transfected. pSVE.E'DHFR was obtained from an ampicillin 
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resistant colony. The E'-encoding DNA thus was placed under the transcriptional control of the SV40 early promoter, 
while the LTR provided sequences for the cleavage and polyadenylation of the E' gene transcripts together with se- 
quences for promoting transcription of the dhfr gene. 

CHOdhfr-K1 DUX-B11 cells (55f) were grown on DMEM medium containing 1 0% fetal bovine serum. Transfect ions 

5 with pSVE.E'DHFR were performed by the calcium phosphate precipitation method (55g) as described (55h). Following 
a 6 hour exposure to the vector, the cells were shocked with 20% glycerol (v/v) in PBS (55i) and grown for 1 -1/2 days 
in nonselective medium. Cells were then passaged into selective medium (F12 medium lacking glycine, hypoxanthine, 
and thymidine and supplemented with 10% dialysed fetal bovine serum) and refed every 2-3 days. A population of 
approximately 300 resistant colonies was massed after two weeks. Immunoprecipitation analysis of a population of 

10 the initial dhfr+ colonies revealed a barely detectable level of E'. This population was further amplified for pSVE.E'DHFR 
by selection for growth in methotrexate (55j, 55k). Approximately 2x10 s cells were seeded into a 100 mm dish of 
selective medium containing 1 0~ 7 M methotrexate and the media changed every 2 days. A population of approximately 
50 resistant colonies arising after three weeks in this media (CHO/EM00) was massed and E* protein recovered as 
follows. The cells were collected by centrifugation, washed with phosphate buffered saline (PBS) and lysed in 2 ml of 

15 RIPA buffer (0.05M Tris-HCI pH7.5, 0.15 M NaCI, 1 % Triton X-1 00, 1 % deoxycholate and 0. 1 % sodium dodecyl sulfate) 
containing 0.5% aprotinin. The extract was preincubated for two hours with 10 mcl. of preimmune rabbit sera at 4°C, 
cleared twice with 50 mcl. of Pansorbin (Calbiochem), and incubated overnight with 2 mcl. of rabbit anti-LE-E' sera at 
4°C. The immunoprecipitate was incubated for 30 minutes with 10 mcl. of Pansorbin, collected by centrifugation and 
the pellet washed twice with RIPA buffer and once with water The immunoprecipitate is solubilized and E' separated 

20 from the rabbit antibody by ultrafiltration, electrophoresis or other conventional technique. On a commercial scale the 
antibody or antisera is preinsolubilized and E' eluted from the immobilized immunoadsorbent using pH 3-5 buffer. 

The population of colonies arising in 1x10~ 7 M methotrexate (CHO/EM00) produced significant amounts of two 
E'-encoded proteins specifically immunoprecipitated by the rabbit sera to LE-E' but not preimmune rabbit sera. Neither 
protein was detectable in the parent CHO cells with the immune sera. Furthermore, immunoprecipitation of both E'- 

25 related proteins detected in CHO/EM00 cells was readily competed by the LE-E' protein but not the LE protein, dem- 
onstrating that the reaction was specific for E' sequences. 

A direct comparison shows that the two E'-related polypeptides made in CHO/EM00 cells have elect rophoretic 
mobilities identical with Mr 28,000 and 26,500 proteins observed in H9/HTLV-II IB cells. In addition, the relative amounts 
of these two proteins in CHO/EM00 and H9/HTLV-IIIB cells are essentially identical. However, it is uncertain whether 

30 the 28,000 and 26,500 H9/HTLV-UIB cell proteins are chemically identical to the E'-related recombinant polypeptides, 
produced herein. Pulse-labeling studies indicated that neither protein in CHO cell culture is the direct precursor of the 
other. Approximately 80-90% of the Mr 28,000 and 26,500 E' protein was contained within the cytosolic fraction of 
transfected CHO cells. Immunoprecipitation of cell supematants from CHO/EM00 cultures did not reveal any secreted 
E' proteins. 

35 The recombinant E'-polypeptides are sterilized by passing a solution of the polypeptides through a 0.22 micron 

filter in order to remove bacteria. The filtrate is formulated into a vaccine by further purifying the polypeptide, for example 
by gel filtration, as desired and formulating the polypeptide into a pharmaceutically-acceptable carrier such as isotonic 
saline, D5W and the like. The amount of E' protein employed will be sufficient upon S.C. injection, followed by i.v. boost, 
to generate a detectable titer of anti-E' in the subject. This will necessarily turn on the immunocompetence of the test 

40 subject, so the dose frequency and route of administration of antigen must be determined by the attending physician 
through monitoring of the serum anti-E 1 levels. E'-containing vaccines optionally include other predetermined A1DS- 
associated retroviral polypeptides such as gp41 . This will be helpful in ensuring.a complete potentiation of the immune 
response to a potential AIDS -associated retroviral infection. . 

The role of the E' gene protein in viral reproduction and pathogenesis remains a major unanswered question raised 

45 by these studies. Our ability to readily detect individuals who are seroreactive with LE-E' but not with whole virus 
antigens on immunoblots suggests that E' is not a virion component, but could also be explained if E' were lost during 
virus purification. Others have described unidentified proteins of Mr 25,000-28,000 in HTLV-I II -infected H9 cells immu- 
noprecipitated by AIDS-related sera (55o,55n,55p), which may correspond to E' proteins, but the same polypeptides 
were not detected in HTLV-IH virion preparations (55o,55p). In CHO/EM00 cells most intracellular E' was found in 

so cytoplasmic rather than nuclear fractions, suggesting that E' does not participate in a nuclear event such as transcription 
initiation. These studies must be considered preliminary, however, since they may not accurately reflect the location 
of E' in infected cells. 

Transcriptional mapping studies have suggested that E' may be an abundantly expressed protein in infected cells. 
Approximately 20% of the viral RNA synthesized in H9/HTLV-IIIB cells represents a family of spliced, subgenomic 
55 1 .7-1 .9 kb RNAs consisting of a 289 bp leader containing sequences 5' to gag , a middle exon located upstream of eny, 
and 1.3 kb of sequence from the 3' end of the genome containing the E' gene (55e). Either of two additional short 
untranslated leaders from the^ol or P' regions may also be present. Depending upon alternate utilization of two splice 
acceptor sites (at nucleotides 5,359 and 5,558), a middle exon of either 268 bp or 69 bp is generated (55e, 55q). The 
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268 bp exon contains two AUG triplets not present in the 69 bp exon, one of which may serve as the initiator codon 
for the tat reading frame (55q). In 1 .7-1 .9 kb mRNAs that contain the 69 bp middle exon the predicted E' AUG initiator 
codon is preceded by a single AUG triplet, but we nonetheless expected it to be utilized efficiently for translation initiation 
since the upstream AUG triplet is flanked by unfavorable nucleotides (55r) and is followed by an in-frame termination 

5 codon well upstream of the E' AUG initiator (55s). Our ability to obtain efficient expression of E' proteins in CHO cells 
with a plasmid containing the upstream AUG triplet (pSVE.E'DHFR) confirmed our expectation. Alternative usage of 
the two middle exons within the 4.3 kb viral mRNA class (55e), similarly determines whether the tat reading frame 
precedes the eny gene. If differential splicing is the mechanism for generating mRNAs encoding either the E' and env 
proteins or the tat reading frame, it may represent a crucial regulatory event in the reproduction of the AIDS retrovirus. 

io E 1 polypeptides or antibodies thereto are employed in conventional assay procedures as are generally described 

above. For assay of the E' polypeptide in the body fluids of test subjects (serum, saliva, plasma, urine, etc.), a "com- 
petitive" type test kit preferably will contain labeled (enzyme, radioisotope, fluorescent group, etc.) E* polypeptide such 
as LE-E' or either species of E' described above, antibody capable of binding E' polypeptide and, optionally, other 
conventional reagents such as incubation and washing buffers. This test kit is employed in methods known per se for 

'5 use with the selected label, e.g. ELISA, radioimmunoassay or fluorescence polarization. Similarly, conventional "sand- 
wich" assays also are useful in determining the presence of E' polypeptide or its antibody in test subject samples. 
Assays for the E' protein are unusual for AIDS diagnosis because they are the first known that are directed at deter- 
mining a protein which is. not present in purified preparations of the AIDS virion but which are encoded by the viral 
genome and apparently expressed by host cells in the course of a viral infection and/or replication in vivo . Detection 

20 of such proteins is a preferred diagnostic approach because they are believed to constitute the first indicia of viral 
infection, preceding the accumulation of virion antigens such as eny and the generation of host antibodies directed 
against HTLV-lll-encoded proteins. 

It also is within the scope of this invention to assay body fluids for DNA or RNA encoding the E' protein using 
standard hybridization assays, e.g. Southern or Northern assays, respectively. DNA or RNA is used in such assays 

25 that is merely capable of hybridizing with the Fig. 2 E'-encoding sequence; it need not encode an E' protein. Such 
nucleic acids are readily synthesized as oligonucleotides and then screened for the ability to hybridize to the E'-polypep- 
tide encoding DNA set forth in Fig. 2. The diagnostic nucleic acid is labelled in conventional fashion using detectable 
tags such as radiophosphorus and the like. 

30 EXAMPLE 8 

Recombinant Synthesis of a Secreted Form of An HTLV-lll Retrovirus Envelope Protein In Mammalian Cells 

Figure 1 1 a illustrates the viral genome with the location of each of the five major open reading frames of the virus. 

35 The env reading frame encodes the viral envelope antigen utilized here. An envelope fragment encoding amino acid 
residues 61 -531 was used for expression of a secreted envelope protein. This fragment was I i gated to a vector which 
contained the components necessary for proper expression of this integrated envelope gene as well as for selection 
in mammalian cells. "SV40 oh" contains an early promoter from the SV40 virus ("E") which is utilized to drive transcrip- 
tion of either the AIDS retrovirus envelope or the dihydrofolate reductase (dhfr) gene. Transcription termination and 

40 message polyadenylation are accomplished using signals derived from the 3' non-translated region of the hepatitis B 
virus surface antigen gene ("HBVs Ag poly A signal"). Growth in E. coli is accomplished by the inclusion of the ampicillin 
resistance gene ("Ap R") of pBR 322 as well as the origin of replication of this plasmid. Finally, the murine dhfr gene 
is utilized as a selectable marker for transfection and selection in Chinese hamster ovary (CHO) cells which lack this 
gene. 

45 

A. Construction of Expression Vector 

The contemplated method for assembling this vector from publicly available starting plasmids is as follows, this 
being a modification of that method which was actually used. pE348HBV E400D22 (Simonsen et aj., 1983, "Proc. Nat. 
50 Acad. Sci.-USA" 60: 2495) is digested with Hpa l and the linearized plasmid recovered. The Hpa l cohesive terminii are 
blunted with the Klenow fragment of DNA polymerase I and vector fragment 1 recovered. A blunt-ended duplex stop 
linker having the coding sequence TCTAGAGGATCCCCAA CTAA G TAA GAT CTAG is ligated to vector fragment 1 -, the 
ligation mixture transfected intoE coli 294 and the vector recovered from an Amp r colony. The underlined sequences 
represent stop codons in each of the three possible reading frames. The inclusion of the stop linker introduces-a syn- 
55 thetic nine-residue polypeptide at the C terminus of the env protein. 

The stop-linkered vector is digested with Xba l, blunted with Klenow, digested with EcoRI and the 5706 bp vector 
fragment 2 recovered. This digestion serves to remove the hepatitis B surface antigen coding sequence. 

Clone H9c.53 described above which contained HTLV-lll retroviral env cDNA is digested with Hha l, blunted with 
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Klenow, digested with EcoRI and fragment 3, spanning nucleotides 5324 (the EcoRI site) to 7394 (the Hhal site at env 
amino acid 531), is recovered. Fragment 3 and vector fragment 2 are ligated, transfected into E. coli and vector 4 
recovered from an ampi II ic in -resistant colony. 

Vector 4 was digested with Ndel (the Ndel site corresponding to amino acid 61 of the env protein), blunted with 
s Klenow, digested with Pstl and vector fragment 5 recovered. This step serves to remove the sequence located between 
a nucleotide located within the Amp r gene in the 3' direction to residue 61 of the env protein, thereby removing the 
retroviral signal and 61 N-terminal residues of the env protein. 

pgDtruncDHFR (EP 1 39,41 7A) is digested with Pstl and Pvull and 1514 bp fragment 6 recovered containing the 
3* portion of the Amp r gene (corresponding to the sequence removed from vector 4 in the previous step), an SV40 
10 origin, the 25 residue herpes gD protein signal and the first 25 N-terminal residues of the mature gD protein. Fragments. 
5 and 6 are ligated, the ligation mixture transformed into E. coli 294 and pAIDSenvtrDHFR was recovered from an 
Amp r colony. 

The skilled artisan will appreciate that the above described vector for the expression of a p41 -viral protein fusion 
is suitably modified for expression of other viral polypeptides than those of herpes simplex. For example, a fusion with 
15 hepatitis surface antigen is readily accomplished by inserting DNA encoding the mature truncated or intact p41 protein 
in place of the surface antigen stop codon, or vice versa, using for example pE348 HBV as a starting plasmid. Selection 
of appropriate restriction sites and synthetic adaptors as required will be within the skill of the ordinary artisan. 

Other signal sequences than the herpes gD signal are used as well. Suitable signals include those of secreted 
eukaryotic (preferably, non-primate) polypeptides or mammalian host range viruses which are known per se. 

20 

B. Transfection and Selection of Mammalian Cell Lines 

On the day prior to transf ection, cells from a confluent 1 0 cm 2 dish of CHO DH FR-cells were split 1 : 1 0 into 2 1 0cm 2 
dishes in F12/DMEM (Delbecco's modified Eagles medium), 10% FBS (fetal bovine serum) + GHT (glutamine, hypox- 
25 anthene, thymidine) + penn-strep antibiotics and grown overnight at 37° C. 

Preparation of Plasmid for Transfection. 

75X 10 mM Tris, 1 mM EDTA, 10 u.g Plasmid DNA (omitting DNA for mock transfection), 65A. CaCI 2 and 0.65 ml 
30 2x hepes buffer, pH7. 1 were carefully mixed at room temperature, and thereafter poured directly on the CHO cells with 
media. Cells and DNA were incubated at 37°C for 3 hours. The media were then aspirated and the CHO cell monolayers 
shocked with 20% glycerol-PBS (2.5 ml) for 45 sec. The dishes were flooded with 7.8 mis of F12/DMEM, aspirated 
and replaced with 12 ml F12/DMEM. Cells were incubated at 37°C for 2 days and then split 1:2 and 1:10 in F12 media 
containing 7% XDZ.FBS (extensively dialyzed fetal bovine serum), with penn-strep. The cells were fed again on day 4. 

35 

Selection and Amplification of New Cell Line. 

On day 7 the mock transfected cells were all dead. The transfected CHO cells split 1 :2 had grown to confluence. 
They were split 1:3 into 3 10 cm 2 dishes containing F12/DMEM, 7% XDZ, 50nM MTX (Methotrexate) - 100 nM MTX 
40 and 250 nM MTX, The dishes were refed every 4 days until confluent monolayers had adapted to MTX. These mon- 
olayers were then labelled in 35 S-met and assayed by immunoprecipitation for the secreted env fusion. 

Cells diluted 1 :10 in selection media had 50-100 colonies at day 10. The colonies were picked and clones grown 
up in F12/DMEM. These clones were screened by radioimmunoprecipitation with anti-env antisera. Positive clones 
were amplified in increasing concentrations of MTX (500, 1000 and 3000 nM) until maximum secretion of env fusion 
45 was observed. 

C. Radioimmunoprecipitation Analysis of Transfected CHO Cells 

Confluent 10 cm dishes were labelled with 66|iC/m35S methionine for 4 hr at 37°C in RPM1 1640 medium lacking 
50 methionine. The media were then collected and aprotinin was added (15 jiL). Cells were removed by centrifugation. 
One ml of supernatant was added to 100 ui. of 2.5 M NaCI, 2% Tween 80, 1- mg/ml BSA. Two ul of high titre human 
anti-AIDS antibody was added and incubated 1 hr at room temp. Protein A Sepharose, pre-adsorbed with non-trans- 
fected CHO protein lysates, was added (50 uL) and the mixture incubated 30 minutes at 37°C. The Sepharose beads 
were then washed by centrifugation 3 times in PBS, 0.2% deoxycholate, 0.2% Tween 20. A final water wash was 
55 followed by resuspension of the Sepharose beads in a standard SDS PAGE sample buffer. The samples were boiled 
5 minutes, centrifuged, and the supernatants were separated on SDS polyacrylamide gels. The gels were then auto- 
radiographed using Enlighten 30. 

Cells producing large amounts of envelope synthesized a -1 20,000 Mr protein which was specifically immunopre- 
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cipitable by anti HTLV-III retrovirus antibodies. This protein was found only in plasmid transfected cells. The intracellular 
form of the antigen was approximately 100,000 daltons in size, and it could be chased, in a pulse-chase experiment, 
into 120,000 dalton, secreted protein. This protein was also found to react on Western blots when incubated with anti- 
AIDS retrovirus antibodies. 

5 

D. Culture of Amplified Transformant CHO Cells 

1 confluent 10 cm 2 dish of transformants from step B was incubated with trypsin to free the cells and transferred 
to a small roller bottle in 50 mis F12/DMEM 50:50 media w/o GHT, low glucose, minus glycine +7% XDZ FBS. C0 2 
io was introduced into the bottle for 5 sec. and capped tightly. The bottle was rolled at 37°C and observed daily until 
confluent (~3 days). 

The medium was decanted, cells rinsed once in 50 mis PBS, and trypsinized for 5 min. at 37°C with 5 mis trypsin 
in EDTA. Cells were suspended in 5 mis of F12/DMEM 50:50 plus 7% XDZ FBS and the suspension placed into 850 
cm 2 roller bottles. This procedure is continued until the desired number of bottles is obtained. When the desired number 
is of bottles reached about 85% confluency, they were rinsed twice in 1 00 mis PBS and 1 25 mis serum-free F1 2/DMEM 
3:1 w/o GHT low glucose, minus glycine added to each bottle. The bottles were incubated at 37°C for 3 days. On the 
third day the media were poured off and retained. This process was repeated twice. The "375 ml of medium collected 
from each bottle was centrifuged at 2000 rpm for 5 min and Na Azide added to inhibit any contaminant microbial growth. 

20 E. Purification of Recombinant HTLV III Envelope Protein 

E. I. Preparation of an Affinity Column for the Isolation of Recombinant HTLV III Polypeptides. 

Sera from patients diagnosed with AIDS related complex (ARC), were tested for the presence of antibodies reactive 
25 with HTLV III proteins by "Western" immunoblot analysis. Plasma from individuals exhibiting a high titers to HTLV III 
envelope protein were collected, and immunoglobulins were purified by affinity chromatography using Protein A-Sepha- 
rose CL-4B (Pharmacia Fine Chemicals) according to the manufacturer's directions. The purification procedure yielded 
material that was approximately 90% pure as judged by polyacrylamide gel electrophoresis (PAGE) visualized by silver 
staining. The purified immunoglobulins were then covalently linked to Sepharose 4B using CNBr^activated Sepharose 
30 4B (Pharmacia Fine Chemicals) according to the manufacturer's instructions. 

E.ll. Purification of Recombinant HTLV III Envelope Protein Fusion. 

Serum free cell culture medium from the method of step D was harvested, clarified by passage through a 0.45 
35 micron Nalgene filter, and concentrated 50-fold by ultrafiltration using an Amicon YM-10 ultrafiltration membrane. The 
concentrated medium was dialyzed against phosphate buffered saline (PBS), and applied to an affinity column of the 
type described above in E.I. After application of the concentrated cell culture medium, the column effluent was moni- 
tored spectroscopically at 280 nm and the column was washed until the effluent was free of material absorbing light 
at this wavelength. Proteins retained on the affinity column were eluted by treating the column with 0.1 M acetic acid, 
40 0.5 M NaCI elution buffer. Material eluted from the column was monitored spectroscopically, and was immediately 
treated with 1 M Tris buffer to adjust the pH ta near neutrality (pH 7-8). The eluted protein was dialyzed against PBS 
and concentrated approximately 5 fold by ultrafiltration with the use of an Amicon YM-10 membrane. The resulting 
protein was analyzed by PAGE and visualized by silver staining and "Western" immu nob lotting. Based on these anal- 
yses, the secreted truncated recombinant HTLV III env fusion exhibited a molecular weight of approximately 130 kd, 
45 and represented from 5-25% of the total protein eluted from the column, depending on the preparation. 

E.lll. Immunization of Rabbits with Affinity Purified Recombinant HTLV III Envelope Protein Fusion. 

Rabbits were immunized with 30-50 micrograms of total protein (prepared as described above) per immunization. 

50 For the primary immunization, 30-50 micrograms of protein in 1 ml of PBS was emulsified with an equal volume of 
complete Freund's adjuvant and injected as follows: 0.5 ml in each rear footpad, 0.2 ml injected at 5 intradermal sites 
along the back. The rabbits were then boosted 14-21 days after the primary immunization. The antigen emulsion for 
the booster immunizations was prepared the same way as that for the primary immunization with the exception that 
incomplete Freund's adjuvant replaced complete Freund's adjuvant. For the booster immunizations each rabbit re- 

55 ceived a total of 2 ml of antigen emulsion distributed as follows: 0.5 ml intramuscularly in each thigh, and 1 ml distributed 
among 5 intradermal sites along the back (0.2 ml per site). Animals were bled via the ear vein or cardiac puncture 10 
to 14 days after each boost. Antibodies elicited against the recombinant HTLV III envelope protein were identified by 
"Western" immunoblot analysis using recombinant HTLV 111 130K protein as the antigen. In the first experiment, one 
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of two rabbits formed antibodies reactive with the recombinant HTLV III protein after one booster immunization. The 
second rabbit required additional booster immunizations to elicit the formation of anti-rHTLV III antibodies. 

F. Neutralization of HTLV III Retrovirus Infection 

5 

Various sera from either control rabbits, rabbits innoculated with the recombinant envelope antigen, normal control 
patients, or patients with known retrovirus neutralizing antibodies were heat inactivated at 56°C for 30 minutes. The 
serum samples were then mixed 1:1 with virus in wells containing RPMI 1640, 20% fetal calf serum, penn-strep, and 
2 u.g/ml polybrene. The wells were incubated 1 hour at 4°C, after which the plates were returned to room temperature 

10 for 1 5 minutes. 1x10 6 H9 human T cells were then added to each well, and the wells were incubated in C0 2 incubator. 
The cultures were split 1:1 on day 4. Cells were assayed for reverse transcriptase and immunofluorescence on day 7 
or 8. Immunofluorescence assays were done on fixed cells using human antisera known to contain AIDS retrovirus 
antibodies. The percentage of cells which were fluorescent was a measure of the percentage of cells which were 
infected by the virus. Reverse transcriptase assays of cell supernatants were done as described. The reduction in 

15 reverse transcriptase levels was indicative of virus neutralization. When either control human or rabbit sera were mixed 
with the virus, approximately. 70-80% of the cells were fluorescent. Reverse transcriptase assays gave approximately 
1.6 x 10 6 cpm incorporated. When serum from a rabbit vaccinated with recombinant envelope antigen was analyzed, 
the percentage of cells fluorescing was 35% and the reverse transcriptase levels were 285,000 cpm. Human serum 
known to neutralize the virus gave 0% fluorescence and 10,000 cpm reverse transcriptase. These results indicate 

20 neutralization with the antiserum from rabbits vaccinated with the recombinant antigen. 

Although the foregoing refers to particular preferred embodiments, it will be understood that the present invention 
is not so limited. It will occur to those ordinarily skilled in the art that various modifications may be made to the disclosed 
embodiments and that such modifications are intended to be within the scope of the present invention. 

The references cited herein, or grouped in the following bibliography and respectively cited parenthetically by 

25 number in the foregoing text, are hereby incorporated by reference. 
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Claims 

w 

Claims for the following Contracting States : BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

I. A fusion polypeptide comprising (a) a first polypeptide sequence of an AIDS associated retrovirus E', env or gag 
polypeptide having at least one antigenic determinant capable of specifically binding complementary antibody and 

15 (b) a second polypeptide sequence which is not an AIDS associated virus polypeptide; wherein said E' polypeptide 

of AIDS-associated retrovirus is defined as the 206 residue polypeptide designated B E ,n in Fig. 2, its naturally- 
occurring alleles, or its amino acid sequence variants which are immunologically cross-reactive with antisera capa- 
ble of binding the E' polypeptide produced in cells infected with AIDS-associated retrovirus. 

20 2. The fusion polypeptide of claim 1 wherein the amino terminus of the first polypeptide sequence is fused to the 
carboxyl terminus of the second polypeptide sequence. 

3. The fusion polypeptide'of claim 1 wherein the second polypeptide is a eukaryotic polypeptide. 

25 4. The fusion polypeptide of claim 3 wherein the eukaryotic polypeptide is an interferon, insulin or human growth 
hormone sequence. 

5. The fusion of claim 3 wherein the second polypeptide is not a yeast polypeptide. 

30 6. The fusion of claim 3, 4 or 5 wherein the eukaryotic polypeptide is positioned at the amino terminus, carboxy 
terminus, or inserted within the AIDS associated polypeptide sequence. 

7. The fusion of claim 1 wherein the second polypeptide is a viral antigen. 

35 8. The fusion of claim 7 wherein the viral antigen is a herpes simplex antigen. 

9. The fusion of claim 7 wherein the second polypeptide is a prokaryotic polypeptide. 

10. The fusion of any of claims 1 -9 wherein the retrovirus polypeptide is gp1 20 or gp160. 

40 

II. The fusion of any of claims 1-10 wherein the second polypeptide is capable of inducing an autoimmune response 
in humans. 

12. The fusion of any of claims 1-10 wherein the second polypeptide is not capable of inducing an autoimmune 
45 response in humans. 

13. The fusion of any of claims 1 -1 2 wherein the retrovirus polypeptide is an E' sequence. 

14. The fusion of any of claims 1 -1 3 which is unglycosylated. 

so 

1 5. A composition consisting essentially of the fusion of any of claims 1-14 and a pharmaceutical^ acceptable vehicle. 



Claims for the following Contracting State : AT 

55 

1. A process which comprises the preparation of a fusion polypeptide comprising (a) a first polypeptide sequence of 
an AIDS associated retrovirus E\ env or gag polypeptide having at least one antigenic determinant capable of 
specifically binding complementary antibody and (b) a second polypeptide sequence which is not an AIDS asso- 
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ciated virus polypeptide; wherein said E' polypeptide of AIDS^associated retrovirus is defined as the 206 residue 
polypeptide designated B E ,n in Fig. 2, its naturally-occurring alleles, or its amino acid sequence variants which are 
immunologically cross-reactive with antisera capable of binding the E' polypeptide produced in cells infected with 
AIDS -associated retrovirus. . 

2. The process of claim 1 wherein the amino terminus of the first polypeptide sequence is fused to the carboxyl 
terminus of the second polypeptide sequence. 1 

3. The process of claim 1 wherein the second polypeptide is a eukaryotic polypeptide. 

4. The process of claim 3 wherein the eukaryotic polypeptide is an interferon, insulin or human growth hormone 
sequence. 

5. The process of claim 3 wherein the second polypeptide is not a yeast polypeptide. 

6. The process of claim 3, 4 or 5 wherein the eukaryotic polypeptide is positioned at the amino terminus, carboxy 
terminus, or inserted within the AIDS associated polypeptide sequence. 

7. The process of claim 1 wherein the second polypeptide is a viral antigen. 

8. The process of claim 7 wherein the viral antigen is a herpes simplex antigen. 

9. The process of claim 7 wherein the second polypeptide is a prokaryotic polypeptide. 

10. The process of any of claims 1 -9 wherein the retrovirus polypeptide is gp120 or gp160. 

11. The process of any of claims 1 -10 wherein the second polypeptide is capable of inducing an autoimmune response 
in humans. 

12. The process of any of claims 1-10 wherein the second polypeptide is not capable of inducing an autoimmune 
response in humans. 

13. The process of any of claims 1-12 wherein the retrovirus polypeptide is an E' sequence. 

14. The process of any of claims 1 -1 3 which is un glycosylated. 

15. A process according to any of claims 1-14 including admixture of the fusion polypeptide with a pharmaceutical^ 
acceptable vehicle. 



Patentanspruche 

Patentanspruche fur folgende Vertragsstaaten : BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1. Fusions- bzw. Verschmelzungspolypeptid, umfassend (a) eine erste Polypeptidsequenz eines AIDS-asspziierten 
Retrovirus E'-/ env- oder gag-Polypeptids, das zumindest eine antigenische Determinante aufweist, die zur spe- 
zifischenBindungvon komplementarem Antikorperfahig ist, und(b) einezweite Polypeptidsequenz, die kein AIDS- 
assoziiertes Virus-Poly peptid ist; worin das E'-Polypeptid von AIDS-assoziiertem Retrovirus als das in Fig. 2 mit 
"E ,D bezeichnete 206-Reste-Polypeptid, seine naturlich auftretenden Allelen oder seine Aminosauresequenzvari- 
anten definiert ist, die mit Antiseren immunologisch kreuzreaktiv sind, die zur Bindung des E'-Polypeptids fahig 
sind, das in mit AIDS-assoziiertem Retrovirus infizierten Zellen erzeugt wird. 

2. Fusionspolypeptid nach Anspruch 1, worin der Aminoterminus der ersten Polypeptidsequenz mit dem Carboxyl- 
terminus der zweiten Polypeptidsequenz verschmolzen ist. 

3. Fusionspolypeptid nach Anspruch 1 , worin das zweite Polypeptid ein eukaryotisches Polypeptid ist. 
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4. Fusionspolypeptid nach Anspruch 3, worin das eukaryotische Polypeptid eine Interferon-, Insulin- Oder humane 
Wachstumshormon-Sequenz ist. 

5. Fusionspolypeptid nach Anspruch 3, worin das zweite Polypeptid kein Hefepolypeptid ist. 

6. Fusionspolypeptid nach Anspruch 3, 4 oder 5, worin das eukaryotische Polypeptid am Aminoterminus oder Car- 
boxyterminus positioniert oder innerhalb der AIDS-assoziierten Polypeptidsequenz eingefugt ist. 

7. Fusionspolypeptid nach Anspruch 1 , worin das zweite Polypeptid ein virales Antigen ist. 

8. Fusionspolypeptid nach Anspruch 7, worin das virale Antigen ein Herpes simplex-Antigen ist. 

9. Fusionspolypeptid nach Anspruch 7, worin das zweite Polypeptid ein prokaryotisches Polypeptid ist. 

is 10. Fusionspolypeptid nach einem der Anspruche 1 bis 9, worin das Retrovirus-Polypeptid gp120 oder gp160 ist. 

11. Fusionspolypeptid nach einem der Anspruche 1 bis 10, worin das zweite Polypeptid fahig ist, bei Menschen eine 
Autoimmunresponse herbeizufuhren. 

20 12. Fusionspolypeptid nach einem der Anspruche 1 bis 10, worin das zweite Polypeptid nicht fahig ist, bei Menschen 
eine Autoimmunresponse herbeizufuhren. 

13. Fusionspolypeptid nach einem der Anspruche 1 bis 12, worin das Retrovirus-Polypeptid eine E'-Sequenz ist. 

25 14. Fusionspolypeptid nach einem der Anspruche 1 bis 13, das unglykosyliert ist. 

15. Zusammensetzung, die im wesentlichen ausdem Fusionspolypetid nach einem der Anspruche 1 bis 14 und einem 
pharmazeutisch vertraglichen Vehikel besteht. 
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1 . Verfahren, das die Herstellung eines Fusions- bzw Verschmelzungspolypeptids umfaBt, das umfaBt: (a) eine erste 
Polypeptidsequenz eines AIDS-assoziierten Retrovirus E'-, env- oder gag-Polypeptids, das zumindest eine anti- 

35 genische Determinate aufweist, die zur spezifischen Bindung von komplementarem Antikorper fahig ist, und (b) 

eine zweite Polypeptidsequenz, die kein AIDS-assoziiertes Virus-Polypeptid ist; worin das E'-Pofypeptid von AIDS- 
assoziiertem Retrovirus als das in Fig. 2 mit n E m bezeichnete 206-Reste-Polypeptid, seine naturlich auftretenden 
Allelen oder seine Aminosauresequenzvarianten definiert ist, die mit Antiseren immunologisch kreuzreaktiv sind, 
die zur Bindung des E'-Polypeptids fahig sind, das in mit AIDS-assoziiertem Retrovirus infizierten Zellen erzeugt 

40 wird. 

2. Verfahren nach Anspruch 1, worin der Aminoterminus der ersten Polypeptidsequenz mit dem Carboxylterminus 
der zweiten Polypeptidsequenz verschmolzen wird. 

45 3. Verfahren nach Anspruch 1 , worin das zweite Polypeptid ein eukaryotisches Polypeptid ist. 

4. Verfahren nach Anspruch 3, worin das eukaryotische Polypeptid eine Interferon-, Insulin- oder humane Vfechs- 
tumshormon-Sequenz ist. 

50 5. Verfahren nach Anspruch 3, worin das zweite Polypeptid kein Hefepolypeptid ist. 

6. Verfahren nach Anspruch 3, 4 oder 5, worin das eukaryotische Polypeptid am Aminoterminus oder Carboxytermi- 
nus positioniert oder innerhalb der AIDS-assoziierten Polypeptidsequenz eingefugt wird. 

ss 7. Verfahren nach Anspruch 1 , worin das zweite Polypeptid ein virales Antigen ist. 

8. Verfahren nach Anspruch 7, worin das virale Antigen ein Herpes simplex-Antigen ist. 
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9. Verfahren nach Anspruch 7, worin das zweite Polypeptid ein prokaryotisches Polypeptid ist. 

10. Verfahren nach einem der Anspruche 1 bis 9, worin das Retrovirus-Pofypeptid gp120 oder gp160 ist. 

11. Verfahren nach einem der Anspruche 1 bis 10, worin das zweite Polypeptid fahig ist, bei Menschen eine Autoim- 
munresponse herbeizufuhren. 

12. Verfahren nach einem der Anspruche 1 bis 10, worin das zweite Polypeptid nicht fahig ist, bei Menschen eine 
Autoimmunresponse herbeizufuhren. 

13. Verfahren nach einem der Anspruche 1 bis 12, worin das Retrovirus-Polypeptid eine E'-Sequenz ist. 

14. Verfahren nach einem der Anspruche 1 bis 13, worin das Fusionspolypeptid unglykosyliert ist 

15. Verfahren nach einem der Anspruche 1 bis 14, das ein Gemisch aus dem Fusionspolypeptid und einem pharma- 
zeutisch vertraglichen Vehikel einschlieSt. 



Revendications 



Revendications pour les Etats contractants suivants : BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1. Polypeptide de fusion comprenant (a) une premiere sequence polypeptidique d'un retrovirus associe au SI DA E', 
env ou gag portant au moins un determinant antigenique capable de se lier de maniere specifique a un anticorps 
complementaire et (b) une seconde sequence polypeptidique qui n'est pas un polypeptide du virus associe au 
SI DA; dans lequel ledit polypeptide E' du retrovirus associe au SI DA est defini comme etant le polypeptide de 206 
residus designe par "E'" a la Figure 2, ses variants alleliques existant a I'etat nature!, ou ses variants de sequence 
d'acides amines qui presentent une reaction immunoiogique croisee avec un antiserum capable de se lier au 
polypeptide E 1 produit dans des cellules infectees par le retrovirus associe au SI DA. 

2. Polypeptide de fusion selon la revendication 1 dans lequel I'extremit6 amino-terminale de la premiere sequence 
polypeptidique est fusionn6e a I'extremite carboxy-terminale de la seconde sequence polypeptidique. 

3. Polypeptide de fusion selon la revendication 1 dans lequel le second polypeptide est un polypeptide eucaryote. 

4. Polypeptide de fusion selon la revendication 3 dans lequel le polypeptide eucaryote est une sequence de Inter- 
feron, de I'insuline ou de I'hormone de croissance humaine. 

5. Produit de fusion selon la revendication 3 dans lequel le second polypeptide n'est pas un polypeptide de levure. 

6. Produit de fusion selon la revendication 3, 4 ou 5 dans lequel le polypeptide eucaryote est sftue a I'extremite amino- 
terminale, carboxy-terminale, ou est insert a I'interieur de la sequence polypeptidique assoctee au SI DA. 

7. Produit de fusion selon la revendication 1 dans lequel le second polypeptide est un antigene viral. 

8. Produit de fusion selon la revendication 7 dans lequel Pantigene viral est un antigene de I'herpes. 

9. Produit de fusion selon la revendication 7 dans lequel le second polypeptide est un polypeptide procaryote. 

10. Produit de fusion selon I'une quelconque des revendications 1 a 9 dans lequel le polypeptide retroviral est la gp120 
ou la gp160. 

11. Produit de fusion selon I'une quelconque des revendications 1 a 10 dans lequel le second polypeptide est capable 
d'induire une reponse autoimmune chez les etres humains. 

1 2. Produit de fusion selon I'une quelconque des revendications 1 a 1 0 dans lequel le second polypeptide est incapable 
d'induire une reponse autoimmune chez les etres humains. 
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13. Produit de fusion selon Tune quelconque des revendications 1 a 12 dans lequel le polypeptide retroviral est une 
sequence E\ 

14. Produit de fusion selon Tune quelconque des revendications 1 a 13 qui est non glycosyie. 

5 

15. Composition constitute essentiellement du produit de fusion selon Tune quelconque des revendications 1 a 14 et 
d'un vShicule pharmaceutiquement acceptable. 

to Revendications pour PEtat contractant suivant : AT 

1. Precede qui comprend la preparation d'un polypeptide de fusion comprenant (a) une premiere sequence polypep- 
tidique d'un retrovirus associe au SI DA E\ env ou gag portant au moins un determinant antigen ique capable de 
se lier de maniere specifique a un anticorps complements re et (b) une seconde sequence polypeptidique qui n'est 

15 pas un polypeptide du virus associe au SI DA; dans lequel ledit polypeptide E' du retrovirus associe au SIDA est 

defini comme etant le polypeptide de 206 residus designe par "E 1 " a la Figure 2, ses variants alieiiques existant a 
retat naturel, ou ses variants de sequence d'acides amines qui presentent une reaction immunologique croisee 
avec un antiserum capable de se lier au polypeptide E' produit dans des cellules infectees par le retrovirus associe 
au SIDA. 

20 

2. Procede selon la revendication 1 dans lequel I'extremite amino-terminale de la premiere sequence polypeptidique 
est fusionnee a I'extremite carboxy-terminale de la seconde sequence polypeptidique. 

3. Precede selon la revendication 1 dans lequel le second polypeptide est un polypeptide eucaryote. 

4. Precede selon la revendication 3 dans lequel le polypeptide eucaryote est une sequence de I'interferon, de I'insuline 
ou de Phormone de croissance humaine. 

5. Precede selon la revendication 3 dans lequel le second polypeptide n'est pas un polypeptide de levure. 

6. Proc6de selon la revendication 3, 4 ou 5 dans lequel le polypeptide eucaryote est situe a I'extremite amino-termi- 
nale, carboxy-terminale, ou est ins6r6 a I'interieur de la sequence polypeptidique associ6e au SIDA. 

7. Precede selon la revendication 1 dans lequel le second polypeptide est un antigene viral. 

8. Precede selon la revendication 7 dans lequel I'antigene viral est un antigene de I'herpes. 

9. Precede selon la revendication 7 dans lequel le second polypeptide est un polypeptide procaryote. 

40 10. Procede selon I'une quelconque des revendications 1 a 9 dans lequel le polypeptide retroviral est la gp120 ou ia 
gp160. 

1 1 . Procede selon I'une quelconque des revendications 1 a 1 0 dans lequel le second polypeptide est capable d'induire 
une reponse autoimmune chez les etres humains. 

45 

12. Precede selon I'une quelconque des revendications 1 a 10 dans lequel le second polypeptide est incapable 
d'induire une reponse autoimmune chez les etres humains. 

13. Precede selon I'une quelconque des revendications 1 a 12 dans lequel le polypeptide retroviral est une sequence 

so -p. 

14. Precede selon I'une quelconque des revendications 1 a 13 qui est non glycosyie. 

15. Procede selon I'une quelconque des revendications 1 a 14 comprenant le melange du polypeptide de fusion avec 
55 un vehicule pharmaceutiquement acceptable. 
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Fig. 2k. 
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